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INTRODUCTION 


Until  about  1950's,  the  statistical  Inference  problems  were 
primarily  formulated  as  problems  of  the  estimation  of  parameters 
and  tests  of  hypotheses.  Estimation  problems.  In  general,  are 
decision  problems  with  Infinitely  many  actions  whereas  hypotheses 
testing  problems  are  two  actions  problems.  For  problems  of  comparing 
k populations  (k  >2),  usually,  more  than  2 actions  should  be 
considered.  Thus  It  Is  not  quite  realistic  to  treat  them  only 
as  hypotheses  testing  problems.  The  classic  tests  of  homogeneity 
were  found  to  be  Inadequate  In  two  respects.  First,  the  formulation 
Is  not  designed  to  answer  many  questions  which  are  of  real  Interest 
to  the  experimenter.  Second,  we  almost  always  reject  the  null 
hypothesis  which  says  all  the  parameters  are  equal  If  enough  data 
are  collected.  To  eliminate  the  shortcomings,  one  should 
formulate  the  problems  as  multiple  decision  problems.  Mosteller 
(1948),  Paulson  (1949),  Bahadur  (1950)  and  Bahadur  and  Robbins 
(1950)  were  among  the  earlier  researchers  to  do  so,  thus  laying 
the  groundwork  for  the  Investigation  of  selection  and  ranking 
procedures. 

'Indifference  zone'  approach,  proposed  by  Bechhofer  (1954)  ^2552!!Li?r.. 

ins 

Is  one  of  the  two  basic  formulations  for  ranking  and  selection  W 

Iiwwry  > 

problems.  In  this  approach,  a single  population  (or  a fixed  j$ti  v 


size  of  populations)  Is  selected  and  Is  guaranteed  to  be  the  one  of 
interest  with  probability  P*  If  the  parameters  lie  outside  some 
subset,  the  zone  of  indifference.  Another  basic  formulation,  which 
is  due  to  Gupta  (1956,  1963,  1965),  Is  the  'subset  selection'  approach. 
In  this  approach,  one  wishes  to  select  a subset  which  contains  the 
population  (or  populations)  of  interest  with  a minimum  probability 
P*  over  the  whole  parameter  space.  The  size  of  the  selected  subset 
depends  on  the  outcome  of  the  experiment  and  Is  not  fixed  in 
advance.  Using  these  two  approaches,  a large  number  of  contributions 
have  been  made.  A complete  bibliography  can  be  found  In  a forthcom- 
ing monograph  of  Gupta  and  Panchapakesan  (1979). 

Bayes  approach  for  selection  and  ranking  problems  has  also 
been  considered.  Recent  contributions  made  In  this  framework  are 
Hsu  (1977),  Gupta  and  Hsu  (1978),  Mlescke  (1978)  and  Kim  (1979). 
Bayesian  analysis  Is  attractive  If  a prior  distribution  for  the 
unknown  parameters  can  be  specified  exactly.  However,  It  Is 
often  that  one  can  only  have  partial  prior  Information.  In  this 
case,  the  prior  Is  restricted  In  some  sub-class  r of  all  prior 
distributions.  The  r-mlnimax  criterion  then  requires  the  use  of 
decisions  which  minimize  the  maximum  Bayes  risk  over  r.  Such  a 
principle  has  been  used  in  multiple  decision  problems  by  Randles 
and  Holland  (1971),  Gupta  and  Huang  (1975,  1977),  Berger  (1977), 

Miescke  (1979)  and  Kim  (1979).  The  first  two  chapters  In  this 
thesis  are  related  to  r-mlnlmax  rules. 

There  are  situations  where  a statistical  decision  problem 
occurs  repeatedly  and  Independently.  Then  frequently,  empirical 
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Bayes  approach  becomes  appropriate  for  consideration.  In  this 
approach,  one  assumes  no  prior  Information  about  the  parameters 
except  for  the  existence  of  a prior  distribution  t.  By  use  of 
this  empirical  Bayes  approach,  one  can  then  guarantee  that  the  rules 
one  uses  are  almost  as  good  as  the  Bayes  rule  with  respect  to  t for 
large  samples.  Empirical  Bayes  rules  for  multiple  decision  problems 
have  been  derived  by  Deely  (1965),  Van  Ryzin  (1970),  Huang  (1975). 
Van  Ryzin  and  Susarla  (1977)  and  Singh  (1977). 

Besides  the  comparison  of  k populations  among  themselves, 
sometimes,  in  practice,  one  wishes  to  compare  them  with  a control 
population  (or  a standard  population).  There  are  many  situations 
where  one  wants  to  select  populations  better  than  a control.  But 
there  are  other  cases  where  one  Is  interested  In  selecting 
populations  close  to  a control.  Contributions  related  to  these 
topics  can  be  found  in  Chapter  20  of  Gupta  and  Panchapakesan 
(1979). 

In  this  thesis,  some  results  about  the  r-mlnlmax  rules  and 
empirical  Bayes  rules  have  been  obtained.  In  Chapter  I,  a problem 
of  selecting  populations  close  to  a control  is  considered.  Under 
the  assumption  that  populations  are  normally  distributed,  our  goal 
Is  to  select  the  'good'  populations.  A '0-1 ' type  loss  is 
Introduced.  When  the  control  parameter  is  known,  we  derive  a 
r-mlnlmax  rule.  When  it  Is  unknown,  a restricted  r-minlmax  rule 
Is  derived.  We  also  find  Bayes  rules  and  minimax  rules  for  the 
unknown  parameter  case.  A comparison  among  these  three  rules  is 
made.  For  r-mlnlmax  rules,  we  show  some  optimal  properties  and 
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some  general  distributions  for  which  r-minimax  rules  can  be  found. 

The  problem  of  selecting  the  t-best  populations  is  discussed 
in  Chapter  II.  It  is  shown  that  If  the  populations  have  PF2 
densities,  then  the  natural  selection  rule  - which  selects  the 
populations  with  the  largest  t sample  values  - is  a r-minimax  rule. 
This  result  has  also  been  extended  to  the  case  where  the  populations 
are  not  necessarily  independent.  Also,  by  a simultaneous  selection 
of  the  t-best  populations  for  all  1 < t < k-1,  a r-minimax  rule 
for  the  complete  ranking  of  k populations  is  derived. 

Chapter  III  deals  with  a problem  of  selecting  populations  which 
are  'better'  than  a control.  Under  a linear  loss,  we  derive  a 
sequence  of  empirical  Bayes  rules  for  uniformly  distributed 
populations.  When  the  priors  are  assumed  to  have  bounded 
supports,  empirical  Bayes  rules  are  obtained  for  more  general 
distributions.  Based  on  Monte  Carlo  studies,  tables  are  computed 
for  the  smallest  sample  size  required  for  the  empirical  Bayes  rules 
to  be  'close'  to  the  true  Bayes  rules. 


* 


I 
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CHAPTER  I 

T-MINIMAX  PROCEDURE  FOR  SELECTING 
POPULATIONS  CLOSE  TO  A CONTROL 

1.1  Introduction 

Problems  of  selecting  populations  close  to  a control  arise  frequent- 
ly In  Inductrlal  production  such  as  to  match  parts  or  to  imitate  some 
popular  goods  in  the  market.  This  may  be  the  first  step  for  quality  con- 
trol, since  after  knowing  the  "good"  populations,  we  may  find  ways  to 
Improve  production  so  that  all  products  are  "exactly"  alike  to  a fairly 
good  degree  of  precision.  Thus  the  selection  problem  Is  Interesting  and 
challenging. 

Many  authors  have  considered  the  problem  of  comparing  populations 
with  a control  under  different  types  of  formulations.  Paulson  (1952), 
Bechhofer  and  Turnbull  (1974)  discussed  problems  of  selecting  the  best 
population  If  the  best  population  Is  better  than  the  control.  Dunnett 
(1955),  Gupta  and  Sobel  (1958)  considered  the  problem  of  selecting  a 
subset  containing  all  populations  better  than  the  control.  Lehmann  (1961), 
Randles  and  Hollander  (1971)  dealt  with  the  problem  of  selecting  popula- 
tions better  than  a control.  Bhattacharyya  (1956,  1958),  Tong  (1969), 
Seeger  (1972),  Huang  (1975),  and  Kim  (1979)  have  considered  partitioning 
a set  of  populations  with  respect  to  a control.  Non-pa rametr 1c  procedures 
related  to  some  aspect  of  the  problem  have  been  studied  by  Rlzvl,  Sobel 
and  Woodworth  (1968),  Purl  and  Purl  (1969).  However,  very  few  papers 
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have  been  devoted  to  the  discussion  of  selecting  populations  close  to  a 
control.  A.  K.  Singh  (1977)  considered  this  problem  and  derived  Bayes 
rules  and  empirical  Bayes  rules  for  Poisson,  Geometric  and  Binomial 
populations.  Except  in  rare  situations,  information  concerning  the  prior 
distribution  of  a parameter  is  likely  to  be  incomplete.  Hence  the  use 
of  Bayes  rules  is  hard  to  justify.  The  use  of  partial  or  incomplete 
prior  information  in  statistical  inference  has  led  to  the  development  of 
the  'so-called*  r-minimax  criterion,  a term  initially  employed  by  Blum 
and  Rosenblatt  (1967).  The  original  idea  of  r-mlnimaxity  is  due  to 
Robbins (1951 ).  To  be  more  precise,  although  we  may  not  know  the  prior 
distribution  completely  we  may  have  enough  information  to  specify  that 
the  prior  is  a member  of  a subset  r of  the  class  of  all  priors, 
r-minimax  criterion  then  requires  one  to  use  the  decision  rule  which 
minimizes  the  maximum  expected  risk  over  r.  It  is  interesting  to  note 
that  if  r contains  only  a single  prior,  then  the  r-minimax  rule  is 
just  the  Bayes  rule  for  that  prior.  At  the  other  extreme,  when  r con- 
sists of  all  priors,  the  r-minimax  rule  reduces  to  the  minimax  rule.  In 
this  chapter,  we  will  consider  the  r-minimax  decision  rule  for  selecting 
populations  close  to  a control  and  compare  it  with  the  Bayes  rule  and  the 
ninimax  rule.  In  so  doing,  it  will  be  shown  how  good  these  rules  are. 

In  Section  1.2,  definitions  and  notations  used  in  this  chapter  are 
introduced,  and  a decision-theoretic  formulation  of  the  problem  is  given. 
In  Section  1.3  and  Section  1.4,  we  derive  a r-mlnlmax  decision  rule 
for  both  cases  when  the  control  parameter  90  is  known  and  when  it  is 
unknown.  It  should  be  pointed  out  that  Randles  and  Hollander  (1971) 
considered  a r-minimax  procedure  for  selecting  populations  better  than 
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a control.  When  e0  Is  unknown;  they  applied  the  Hunt-Stein  theorem  to 
prove  the  r-mlnlmaxlty  of  their  rules  for  the  component  problem.  However, 
the  proof  given  by  them  does  not  justify  that  the  r-minlmax  rule  of  the 
component  problem  will  give  us  a r-minimax  rule  for  the  whole  problem. 
Mlesche  (1979)  gave  another  technique  which  can  be  applied  to  our  problem 
to  solve  for  the  r-minlmax  rule  when  e0  Is  unknown;  this  Is  done  in 
Section  1.4. 

In  Section  1.5  some  optimal  properties  of  r-minlmax  rules  are  found. 
In  Section  1.6,  we  generalize  the  results  of  Section  1.3  and  1.4,  and 
derive  r-mlnimax  rules  for  some  more  general  distributions  besides  the 
normal.  A r-mlnimax  rule  for  selecting  the  populations  with  large 
entropy  Is  given  as  an  example.  In  Section  1.7,  under  the  assumption 
that  the  prior  distributions  are  N(<M,  Bf2),  we  find  the  Bayes  rules 
and  we  also  find  the  minimax  rules.  These  rules  and  the  r-minlmax  rules 
found  In  Section  1.4  are  compared  In  Section  1.8  in  terms  of  the  Bayes 
risk,  the  maximum  risk  over  r,  and  the  overall  maximum  risk  for  all 
possible  choices  of  the  prior  distributions. 

Numerical  tables  are  given  for  selected  values  of  variables  for 
comparison  of  these  rules.  Finally,  In  Section  1.9, we  give  an  example 
In  which  we  apply  the  optimal  selection  rules.  Conclusion  about  the 
robustness  of  each  rule  discussed  in  this  chapter  are  also  given  in 
Section  1.9. 

1.2  Notation  and  formulation  of  the  problem 

Let  JIq,  JIr...,nk  be  (k+1)  Independent  normal  populations  with 

p 

means  0q,  ^,...,9^  and  common  known  variance  o , respectively. 
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nQ  Is  the  control  population,  the  other  populations  are  defined  as 
good  or  bad  by 

Definition  1.2.1.  Let  A > 0,  e > 0 be  two  given  numbers,  then 
(1)  Population  Is  good  Iff  (0Q-0J  S A 

(11)  Population  is  bad  iff  l9O'0il  - A + e. 

Note  that  we  do  not  define  ll^  as  good  or  bad  if  A<  |0.-9^|  < A + e , 
which  allows  us  to  regard  it  as  an  indifference  zone  between  good  and 
bad  populations.  Throughout  this  chapter,  A and  e will  be  assumed 
given  and  fixed.  We  are  Interested  In  selecting  as  many  as  possible 
good  populations,  and  rejecting  as  many  as  possible  the  bad  ones.  We 
formulate  this  problem  in  the  framework  of  multiple  decision  theory. 

Let 

/ (9*  (9q.01.....9|()  I -•  < 9j  < 00  for  all  1 = 0,...,k} 


parameter  space 


if  9q  is  unknown 


(0*  (0^ . . ,0k)  | -<*>  < Qj  < ® for  all  1 ■ 1 . ,k) 


if  0Q  is  known 


Let  eG(i)  = (0  | |0reol  - A*»  ®b(1)  = t®  te  | l0i-Qol  * A + e). 

Let  Xn,  X^,...,  Xin  be  the  observations  from  (0  < i < k).  Since 

= l ^ X.j  is  the  sufficient  statistic  for  0i  and  {X^q  are 

independently  normally  distributed  with  means  6n,  0,,...,  e.  and 

2 U | K 

common  known  variance  ( = ^_  ),  so  without  loss  of  generality  (wlog) 


9 


we  can  assume  that  there  Is  only  one  observation  X.  from  each  population 
n,.  Then 

V 1*0=  0-  e-^IX-9')2 

/?n  o 

and 


X - 

The  sample 

r 

X-  i 

k 


{(Xq,X| ,Xk)  for  0q  unknown 

(Xj i. . . |X^)  for  0g  known. 

space  X Is  defined  as  follows: 

(Xs  ( Xq * » . . • ) | -•  < x<  < ® 

If  0g  is  unknown 

(8  * (X^*...»X^)  | -oo  < Kj  < oo 

if  0g  is  known. 


\ 

for  all  i*0,l 


for  all  i*l . . 


..k} 


.k  } 


Let  (X,&) 


be  the  usual  Lebesque-measurable  space.  Let 


D ■ {6  * (<Sj,...t6^)  | 6^  : X [0,1]  is  a measurable  function, 
for  all  1 s i i k }. 


Then  0 is  the  set  of  all  selection  rules  and  6^ (x)  is  the  probability 
of  selecting  It^  when  we  observe  X * x. 

Let  Lj  denote  the  loss  incurred  when  we  fail  to  select  a good  popula- 
tion and  L2  the  loss  for  each  bad  population  selected.  We  define  the 
loss  L(e,6)  of  using  selection  rule  6 when  0 is  the  true  state 
of  parameter  as  follows: 
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Definition  1.2.2.  L(0,$(*))  = z L(l)  (0,$.(x)) 

1=1  * 1 ~ 

IL1(1-61(s))  if  0 € 0g(i ) 

L2  6.(x)  if  0 € 9 B ( i ) 

0 otherwise. 

(1.2.1) 

Finally,  we  will  assume  that  our  partial  information  is  that  has 
probability  Ai  to  be  good  and  probability  AT  to  be  bad.  Also,  A^ 
and  A 1 are  known  to  us  with  0 i A. , A j 4 A^  + AT  < 1 . 

Definition  1.2.3.  r = {t  | t is  a prior  distribution  on  8 & 

pxfeG(i)J  = pxCaB(i)]  = A'  , for  all  0<i<k> 

0.2.2) 

One  can  see  that  r is  the  class  of  all  possible  prior  distributions  on 
9 which  summarizes  our  information  about  0Q,01 , . . . ,ek.  Let  denote 
the  Lebesque-Stiel jes  measure  corresponding  to  t,  then  for  any  Lebesque- 
measurable  set  A Pt[A]  = jA  dt(0). 

Definition  1.2.4.  For  all  t € r and  g € D,  we  define 
r(t  ,6)  = Et[R(g,5)] 

where  0 is  a random  variable  distributed  as  T(g)  and  R(0,5)  = 
Eg[L(0.6(X)] . 

A rule  6*  € D is  said  to  be  a r-mlnimax  rule  iff 

sup  r(t,6*)  = inf  sup  r(T,<$). 
t€T  ~ 6€D  t€T 
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Definition  1.2.5.  For  any  i (1  < 1 < k)»  the  i-  component 

problem  is  to  treat  the  above  problem  as  if  we  only  pay  for  the  loss  for 
wrong  decision  about  . Hence,  the  i—  - component  problem  is  only 
concerned  with  (0,{6.(x)},  L^).  Similarly,  we  will  use 
R(l)(e,fii)  = Eg[L(i)(§,6.(X)]  and  r(i)(x,6.)  = Ex[R(i)(0,6.)]  to 
denote  the  risk  of  1—  - component  problem.  We  see  that 

= R^((M.j)  and  r(x,6)  = E r^(x,6.).  This  suggests 

that  in  order  to  find  the  r -minimax  rule,  we  may  treat  the  i—  - 
component  problem  separately.  In  the  next  section,  a r-minimax  rule 
is  derived  for  the  case  when  0Q  is  known. 


1.3  Derivation  of  a T-minlmax  rule  when  0Q  Is  known 

In  this  section,  0q  is  treated  as  known.  We  consider  the  i— 
component  problem  first. 


Lemma  1.3.1.  Let  <5^(x)  be  an  i—  - component  decision  rule,  if 

ee^D  Ee[V-X>]  = 


0®b(1)  = * E6i=0o-A-e^6i^^  * 


then  for 


r0(i)  = (x€  r|  pt[0.=0o+a]  + pt[01=0o-a]  = x1 

and  PT[0i*0o+A+e]  + PT[0i=0()-A-e]  * X'}, 


we  have 


sup  r^1^(x,6i)  = r^(To*V  for  all  tq  € rQ(1). 


(1.3.1) 
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Proof:  v ter, 

r<,,(T,Si)  ’ JsG(t)  tgM-VW  <M«) 

* Jee(l)  Ee[4  V*»  *<«> 

5 L'Xj  ' L’X(  (^fG„,  W*»>  * W -Jf,SB,(Pl) 


* Vi  - Liyei  ■ eo  * ^ Ee,.e0+4Csi(*)3 


- llpt0t«i-V43  Eet-e0-a 

+ L2PT0[9reo+Att5  Ee,.e0+lt£ 

+ Vt/ VV4^  «,«)] 

* |esM)  Esh  <'-{i(«»1  <*„<!>  + L (1)  E8a2«,(x)]  <h„(e) 


= r(l^  (tq,5.). 


The  following  lemma  has  been  widely  used  to  solve  for  the  r-minimax 
rule.  It  is  stated  here  without  proof. 

Lenina  1.3.2,  (Randles  and  Hollander  (1971)) 

If  there  exists  a prior  distribution  r*  € r such  that  the  Bayes  rule 
6*(x)  for  the  i—  - component  problem  wrt  t*  satisfies 
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sup  (t*,6J)  for  all  1 3 1,2,. ...k. 

then  §*  3 ($*,. . . ,«S*)  is  a r -minimax  decision  rule. 

Combining  Lenina  1.3.1  and  Lenina  1.3.2,  we  get  the  following 
theorem: 


Theorem  1.3.1.  If  for  1 3 1,2,...,k.  <S*(x)  Is  a Bayes  rule  for  the 

i ~ . 

1L  *' 

1—  - component  problem  wrt  the  same  prior  distribution  t*  e O I'Q(1) 

* * * isl 

and  assume  that  6^  satisfies  (1.3.1),  then  <$  = (6j,...,s£)  Is  a 

r-mlnlmex  rule. 


Proof:  Since  t € rQ(1)  for  all  1,  hence  by  Lemma  1.3.1, 

(1.3.1)  >*0  sup  r^  (t ,<5?)  3 r^  (t *,6?)  for  all  1. 
x€r  1 1 


Then  by  Lemma  1.3.2, 


* . * 


6 3 (61,...,6k)  is  a r-mlnlmax  rule. 


Remark:  For  Lemma  1.1.2  and  Theorem  1.3.1  to  hold,  we  do  not  need 
to  assume  that  the  populations  are  normally  distributed.  But  to  satisfy 
condition  (1.3.1),  we  will  restrict  ourselves  to  normal  populations 
from  now  on.  Some  results  for  general  distributions  will  be  discussed 
In  Section  1.6. 


To  verify  (1.3.1),  some  tools  which  transfer  the  wonotonlclty  of 
functions  on  X to  the  nonotonlclty  of  function  on  e are  needed.  We 
quote  some  definitions  and  theorems  from  Karlin  (1968). 
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Definition  1.3.1,  X - fQ(x)  is  said  to  be  TPn 
order  n)  Iff  for  any  9,  < e2  < ...  < en,  < x 

have 


(Totally  Positive  of 

2 < *•*  < V ** 


for  all  1 < m < n. 


V*l> 

fe(x,) 

m 


V*»> 


W 

m 


> 0 


S^nitton  '-3-^  » «-f9(x)  Is  TPn  for  all  n - 1.2 

X is  said  to  be  TP  (Totally  Positive). 


then 


^"a  u'3'  lf  fe!x)  - a(e)  b(x)  ea(9>6<’l>,  .here  a(e)  > 0. 

t>(x)  > 0,  and  a(9),  6(x)  are  Increasing  functions,  then  f.(n)  Is 

9 ' 

TP. 

Deflation  1.3.3.  For  any  real-valued  function  h,  let  S(h)  denote 
the  number  of  sign  changes  of  h;  Me  define  S(h)  « n Iff  there  exist 
Xj  < x2  < ...  < xn+1  such  that  either 

(-1)J+1  h(Xj)  > 0 v j = l,2,...,n+l 
or 

(-l)j  h(xj)  >0  V j = l,2,...,n+l, 

but  for  any  y1  < y?  < ...  < y^,  the  above  two  inequalities  do  not 
hold. 


• i: 
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Theorem  1.3.2.  (Karlin)  Variation  Diminishing  Property 

If  X - f0(x)  Is  TPn  and  h Is  a plecewlse-contlnuous  function.  Let 

g(9)  - E0[h(X)],  then 

S(h)  < n - 1 ~>  S(g)  < S(h). 

Furthermore,  If  $(g)  ■ S(h)  ■ n - 1,  then  g and  h change  signs  in 
the  same  order. 

Corollary  1.3.1.  If  h(x)  ■ (x)  where  I is  the  Indicator 

function  and  X - fQ(x)  * a(e)  b(x)  with  a(e),  e(x) 

increasing  in  e,  x,  respectively,  then  if  g(e)  * En[h(X)]  and 
g(eQ+e)  » g(eQ-e)  for  some  eQ,  we  have  g is  Increasing  for  0 < eQ 
and  decreasing  for  e > e^. 

Proof:  X TP  (by  Lemma  1.3.3)  X is  TPj. 

Now,  for  0 < c < 1,  let 

hc(x)  * h(x)  - c and  gc(e)  » g(e)  - c, 

then 

EQ[hc(X)]  - gc(e). 

Since  S(h£)  * 2 < 3 - 1,  we  get  S(g^)  < 2 (by  Theorem  1.3.2) 

VO  < c < 1.  Now,  if  g is  not  increasing  for  0 < 0q,  then  there 
exists  e1  < e2  < eQ  and  g^)  > g(02).  Let 
0'  ■ 20q  - 01  and  ©2  - 2eQ  - e2, 

we  get 

9(8f)  ■ g(61 ) and  g(0|)  ■ g(02),  but  ©2  < 0'. 

Now  let 

c0  ■ \ Cg(©1 ) + g(e2)),  then  0 < c0  < 1. 


Figure  1.  Number  of  sign  changes  of  function  g. 


As  we  can  see,  S(gc^)  > 2 =*>  S(gCQ)  * 2.  But  then  by  Theorem  1.3.2, 
gCo  should  change  sign  in  the  same  order  as  h£  does,  which  is  not 
true  for  g£  and  h£.  This  completes  the  proof. 

Remark:  For  Corollary  1 .3.1  to  hold,  we  only  need  X to  be  TP,.  But 
since  we  will  consider  the  distributions  which  are  mainly  TP,  so  we 
have  this  stronger  assumption  Included. 


6(x)  * t+e0](x>  for  some  t > 0.  9(6)  * E0[6(X)], 

then  we  have 

g(e)  is  increasing  for  e < 0Q  and 
g{9)  is  decreasing  for  0 > 0Q. 


Proof:  Let  Z - N(0,a2),  then  Z ~ -Z. 

Now,  g(0+0o)  = Pr[-t  + 90  S Z + 9 + 0Q  s t + 0O] 

= Pr[-t  -6<Z<t-0j 

= Pr[-t  +6s-Zst+6] 

= Pr[-t  + 05Zst  + 0] 

■ M-t  + 60  S Z + 0O  - 0 £ t + 0q] 

3 9(90  - 0) 
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then  by  Corollary  1.3.1,  we  proved  g(o)  Is  increasing  for  0 < o0  and 
decreasing  for  0 > 0Q.  This  completes  the  proof. 

Note  that  Corollary  1.3.2.  Is  Important  for  us  to  justify  condition 
(1.3.1).  Now,  we  turn  to  the  main  theorem  of  this  section. 


Theorem 


1.3.3.  Let  X,j  ' NfOj.o'j  for  1*1,2 k be  independent 


random  variable  with  known.  If  §*  « (6*,...,6^)  where 

W " I[-t1  ♦ 0O,  e0  ♦ t^  (x1J 

and  +tj , are  determined  by  the  equation 

i2y,M  ) * ♦(  ^ )] 


■ ^ > * ♦<  tJir  m . 


(1.3.2) 


then  Is  a T-mlnlmax  rule. 


Proof;  We  define  t to  be  the  prior  In  r such  that  01 0k 

are  Independent  and  satisfy 

P x*[®i  • «0  - A - e]  ■ PT*[«1  ■ eQ  ♦ A ♦ e]  ■ y 
pT*[ei  " flo  - • PT*[ei  ■ e0  ♦ A]  - y 

PT*c®^  " eo  * A + 7 3 ■ 1 ■ ^ » 

* k 

for  all  1 ■ l,2,...,k.  Then  It  Is  easily  seen  that  t € n rn(1). 

1-1  u 

Now,  let 

V«>  ‘ 


where 


fe1(xl)  “ l *(  ^ >• 


then  we  have 


Ai)  (T*-61>  “ Jo  Jx  L(1)(e.«i<?»  ffi(s)  dx  dr*(8) 


1 1 e1  “e0 1 -A  f9^  d*  dT  ^ 

+ | |Oj-0o|iA+e  jx  L2  M2*  d*  dT  ^ 

L 1,0-64(8))  E f fl(x)  P *(G)  d* 

J*  1 1 eete^eQ-A)  8 T 

♦ f L. (1-64(8))  2 Ux)  pT*(e)  dx 

Jx  1 1 ge^-e^A)  9 T 

+ L UMx)  z f6(x)  PT*(e)  dx 

•|X  1 1 ft€{ef»0o-A-e)2  T 

+ L M<(x)  1 ffl(»)  PT*(o)  dx. 

JX  2 1 ^(e^GQ+A+e)  H ' 


Now  we  may  notice  that 

E M»)  PT*(0) 

0e{0-(=Oo-A}  " 

Ti 


fe0-A(Xi)  ^ 


Qt { 0 ^ =0q- A-e } V**  Px*W 
f0n-A-e*X1*  ^r 


e fB(g)  PT*(e) 

fli(9t»G0»A)  9 T 

f0o+A^X1^  "J" 


E M*)  PT*(§) 

gtee^So+A+e)  9 T 

fe0+A+c*xi*  T 


Any  of  the  above  four  expressions  Is  denoted  by 


c(x1 9 • • • »x  9*  • • 9 x |^ ) • Hence  9 


20 


Since  cosh(y)  . cosh(-y).  w SM  ^ 

MW  ■ W*(). 


cosh 

.aT  (xi-00)(A+e)~l 

cosh 

0.3.4) 


Moreover, 


_1  cosh  ax  5 

dx  cosh  bx  * 


cosh^  S,nh[(-b»*3- 
Hence,  for  a > b > o cosh  ax  . 

If  X > o Thi  . ’ S55rEr  5 *S'"9  f0r  * < 0 »"«  Increasing 

« 0.  This  shews  that  h,(,,>  „ decreasing  for 

increasing  x(  > e„.  Hence  It  fe.lews  that  , ° 

has  two  solutions,  + t + n Th^-  « ,,  * 

* - ci  + 90"  The".  f™n  0.3.3), 

1 if  h(xt)  s 1 

0 " h(xt)  > l 


6*(*)  = fifa) 


r 


1 1f  m*i  + eo  $*i  * e0  ♦ t, 

by  0.3.4) 

0 otherwise 

and  tt' s satisfy  (1.3.2). 

y'  1Ct  then  from  Corollary  1 3 2 

9(VV  ■ »«W  and  9 is  Increasing  for  e < e .M  ’ 
for  e > a tm  1 0O  and  Creasing 

* 1 °0*  Th1s  Proves  that 


and 


«»"«)  9(ei>  ■ »<V4>  ■ ><V4> 


eeejfi)  9(V  ’ *<V4*>  ■ 9(e0-e-c), 
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f.e.,  condition  (1.3.1)  Is  satisfied.  Now,  Theorem  1.3.1.  shows  that 
6*  ■ (6j, ...,«£)  Is  a T-minlmax  decision  rule.  This  completes  the 
proof. 

Note  that  It  may  happen  (1.3.2)  does  not  have  a solution  for  some 
Aj,  A^,  A,  c.  In  this  case,  the  r-minimax  rule  implies  that  all  popula- 
tions are  bad. 

The  solutions  t^  depend  on  A^  and  AT  only  through  their 
ratio  v.|  = A../  AT  (see  (1.3.2)),  hence  6*  Is  actually  a r-minimax 
rule  for  r » (t|  PtC»6(D3  / PT[aB(1)]  * for  1-1,2 k). 

1.4  Derivation  of  a restricted  r-mlnimax  rule  when  9g  Is  unknown 

In  this  section,  9q  will  be  treated  as  an  unknown  parameter.  As 
mentioned  in  Section  1.2,  8,  X,  and  X will  Include  one  more  component 
and  one  observation  XQ  is  taken  from  nQ. 

Definition  1.4.1.  Let  D|  c D be  the  class  of  rules  such  that  the  1— 
decision  rule  depends  only  on  XQ  and  X^,  i.e., 

0]  a * (<51....,fi|()  € D ( 6^  * (xo,x1 ) v 1 * 1 S k). 

0.4.1) 

We  derive  a r-minimax  rule  in  the  class  Dj.  The  problem  whether 
our  rule  is  also  r-minimax  in  D when  9q  is  unknown  is  not  solved. 

Ferguson  (1967,  P.  90)  gives  two  theorems  to  provide  solutions  for 
the  mlnlmax  rule.  Lemma  1.3.2.  is  similar  to  his  Theorem  1 to  solve  for 
a r -mlnlmax  rule,  and  the  following  lemma  (due  to  Miescke)  is  similar 
to  Ferguson's  Theorem  2. 
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l^naj^  (Miescke)  If  {t„  € r£,  Is  a sequence  of  priors  and 


: th 


in  n=l  ’s  a se<Juence  of  Bayes  rules  corresponding  to  r for  the 


v--  component  problem  for  all  1 = 1,2 k,  then  5°  = (6° 6°) 

Is  a r-minlmax  rule  iff 

limjnf  r(()(Tn.j“n)  > suf  r(t>(t.5»)  for  .11  k. 


(1.4.2) 


Proof:  Let  6 = («| 6fc)  be  any  selection  rule.  Then 


sup  r(i  ,6)  = sup  I r^(T,6.) 

ter  t€r  1=1  I7 


.0), 


k 

2 S if,  rU'<T-.'5.>  * «*, 

2 $ j,  r<<)(T-s?)  ■ ”p  r^°>  ■ 

This  proves  6°  Is  a r-minimax  rule. 


To  use  the  preceedlng  lemma,  we  need  to  find  a sequence  of  priors 
and  their  rules  so  that  (1.4.2)  holds.  Now.  each  prior  distribution 
t on  &0  xe,  x ...  xak  can  be  specified  by  the  marginal  distribution 
T0  on  eQ  and  the  conditional  distribution  a)0  on  x ...  x@k 
given  0Q  = oQ.  We  will  use  r = (T0,a>e  ) to  denote  such  prior 
distributions.  Let 


where 


Tn  a ^Tn’%^ 


(1)  Tn  Is  the  marginal  distribution  on  eQ  which  is  assumed  to 
be  uniform  over  [-n,n]. 

( 1 i ) Given  Og  * 0g.  l.e.,  under  , 0^,...  ,9^  are  independent 
and 

\0  [9i  “ V A ' e |0O]  * P<^90[ei  = °0  + A + £l  90]  = T 
Pu,90  ^9i  * 60  ' A I V = Po)QQ^®i  = e0  + *!0O]  = T 

Pw  ^0i  0O  + ^ + ? I ®(4 
°0 

We  will  also  use  the  notation  to  denote  the  conditional  marginal 

0 

distribution  of  9{  under  ua  . 


Lenina  1.4.2.  Let  tp  be  defined  as  above,  then  under  the  loss  function 

as  defined  by  (1.2.1),  the  Bayes  rule  in  the  class  D,  wrt  t for  the 

I n 

1—  - component  problem  is 


51n^x0’xi^ 


1 iff 


ff  ^ Ln[fVA+e(xi)+f0o-A-e(x1)]f9o(xO)d0O 

" XiL]  j_n^feo+A^Xi^+f0o-A^xi^f0o^xO^d0O 


(1.4.3) 


where  f0(x)  * 4-  $(  ~ ). 


croon  r •l'„-‘i''j_n[j|„1.e0|<4L,('-E(60,e1)^(*o'xi”>'1“9(;(9t 


L2E(0o.e, )t<S1  <x0’xl >]d“6gfe1  ^ dW 

“ fHU.n[^''(1'S(<X0’Xl')^f80*4*xl'  * V4<Xt)lf60<Xo) 


* T 4Mx0-xi>[fe0*A«<xi>  + fe0.4.e(x 


,)]]«» 


(0O)dxidxO 


* f.fJ.r,T  L.Vl>  * dTn<e0>dx1dxC 


* r.£,6i(xi,N)[41  |.n[fe0+A*c(x()  * f90-A-E(xi):|fe0(xo)Sid8o 


- -V-  l.n[Va<Xf)  * V4(x')%(x°)^d9oj<IXidx< 

Hence,  the  Bayes  rule  is  as  shown  in  (1.4.3). 

Now  we  find  the  Bayes  risk  of  6?  wrt  is 

in  n 


+ VVX 


(i ) ri  f°  r°°  [ n A . L,  rn 

r (,",6<")iJ4[1-{i"(xo-xi,M.„[V4(x,,*V4(Xi)],eo(X(>,'IT''(e«> 

* 4li/Vx1*  T~  |_n  ffo0»a+c^x)  *tfe0-A-Jx!^fo0(xo)dVeo|jdxidxo 
t I Jf9„*A*e*Xt**fe„-A-£*X1^f9 JX0)dV vi  dxidxo  • 
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If  we  consider  the  change  of  variables  f xi  nyi  + y0 


x0  = "yi  ' y0 


of  the  two  outside  integrals,  then  change  eQ  = ny.  - n0  for  the 

I 3 (xi >xq)  I 

inside  integral.  Since  | jfy—yj  I = 2n  and  f0(x)  = f0(x-o), 


we  get 


m n 0000  xiLi  n(y,+1) 

r (Tn’6in)  = / / m*n  ( 2~  / [f0(y0+n0"A)+f0(Vn0+A)] 

-®  n(y..-l ) 


fo^no’yo^  dn0’ 


x\l9  n(y.+l) 

T^/  , . [f0(VVA"e)+f0(Vn0+A+e)]f0(Vy0)dn0}dyidy0 

) 

® 1 X-L,  n(y.+l) 

if  / mint  2~/.  Jfo^yo+no"A^+fo^yo+no+A^fo^no"yo^dno’ 

-*  -1  n(yi~  I ) 


\\ |U  n(y.+l) 

— T- \ . . Jfo(yo+VA'e)+fo(yn+nn+A+£);|f 


nty^l) 


0^0  "0 


Fo(r,o'yo)dno}dyidyo* 


(1.4.4) 


To  find  a lower  bound  of  lim  inf  r(x  ,6?  ),  we  need  to  use 


the  following  facts: 


(i)  Fatou's  Lema. 


(ii)  — fn(— ) = / fn(x-z)f0(x-b)dx,  where  fQ(x)  = — r~  e 2° 
fl  0 Jl  — 0 u ^To 

From  (1),  (ii)  and  (1.4.4),  we  get 

/i  i a 00  1 X ^ L-,  2yA-A  2y/\+A 
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mm 


and 


sup  g(nJ  * g(A+e)  - g(-A-e) 
|n1 | l A+c 


inf  g(n.)  = g(A)  = g(-A). 

1 ^ A » 


hi  | < a 


NOW,  VT  = (Tn,u>  ) € r, 
u oQ 


r(i)(T,sO)  = / {/  L (1-E  [«?(xrx0))K  <ei> 

--  IV0OI  - A 10  10  0 1 

+ f i L2Ee.-e  [‘s?(xi"xo^daio  (ei»dTo<°o> 

I ei -oQ  l 1 A+e  c i 0 1 u °o  1 u u 

< Ll(l-  inf  E C6°(Xi-x  )D)/  ( dJ  (o  )dT  (o  ) 

I0r0OllA  1 0 1 1 0 -1e1-0o|<A  0o  1 0 0 

+ E2 1 E0  -0  (X<“X0^/  / dtaig  (0i,)liTn(0n) 

^Io.-OqI  > A+e  °0  1 1 U 10.-0O|  > A+e  0O  1 0 0 


= L2X.(1-  inf  g(nJ)+l?x!  sup  g(nJ 
|n^  | £ A |r)j  | 21  A+c 


= LlXi(1.  a(A)+|(.-A))  + l2x,  glA.nj+g(-A-e) 


Vi 


LoX] 


^0-EA[6'/(Y1)]+l-E.A[6!f(Yi)])+  -2/(EA+c[A0(Y.)]+E.A.t 


[«?(Y.)]) 


’1XJ1 


o o 


• E.x,  y,-A  y.+A  , L,x:  y,-A-e  y4+A+e  , 

■ / m i n f-_2~  ^ ^ ( “oT1 — )+*(“^T~ )]  “2 — [♦( — ^ )+4>( — ^-r — )]  ^*r}dy. 


^ ■ — ■ 
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~ L,\.  y^-A  y>A  l?Xl  y^-A-e  y^+A+e 

- / min(-lj-[fn(J~)+fn(— U],  -i_i[fn(J-_)+fn(-L_)])dy 


in 


n 


n m 


n 


n 


< 11m  Inf  ^(in»A^n)  by  (1.4.5). 

n -*■  «* 

Hence,  we  have  proved  that  sup  r^(T,<s®)  < l^mjnf  r(xn,6^n)  *or  all 

00  0 

i 3 1,2,. ...k.  By  Lemma  1.4.1.,  we  conclude  6 ■ (6^, ...,<$£)  Is  a 
r -minimax  rule  In  . This  completes  the  proof. 

By  the  same  reason  as  mentioned  In  the  last  paragraph  of 
Section  1.3,  6°  is  also  a r-minimax  rule  in  Dj  for 

r = HPT[0G(i)]/PT[eB(i)]  * vi , for  all  i = 0,1,.. ..k}, 

where  3 X^/Xi  for  i 3 0,1, ...,k. 


1.5  Optimal  properties  of  the  r-minlmax  rule 

As  mentioned  in  Section  1.2,wlog  we  can  reduce  the  sample 
size  to  1 for  each  population.  If,  in  fact,  we  observe  X^,...,X 


In 

from  n^,  the  r-minimax  rule  remains  the  same  with  the  substitution 

1 n 

of  X.j  by  R-n  (X.n  3 - l X..).  We  now  prove  the  following  theorem. 

j=l  J 

When  Oq  is  known, 

11m  Inf  Efl[S*(Ln  )]  3 1 

n—  |ere0|  < a ! 1 in 


Theorem  1.5.1. 


and 


1 ini 


SUP  EeC'sT(Xin)J=  °* 


n 1 0 ^ -0 q | ^ A+e  - 


The  above  theorem  says  that  as  n becomes  sufficiently  large,  the 
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probability  of  selecting  a good  population  approaches  1 uniformly, 
and  the  probability  of  selecting  a bad  population  goes  to  0 
uniformly. 

To  prove  Theorem  1.5.1,  we  need  the  following  lemma. 


Lemma  1.5.1.  For  any  sample  size  n,  the  r-minlmax  rule  f>*  = 
(6f  can  be  written  as 


«?<*(„ 


1 ' I[-ej(n),tj<n)](;tn-°0)> 


then 


11m  t. (n)  = A + j . 
n-*»  L 

Proof:  From  the  proof  of  Theorem  1.3.3.,  we  know  t^ (n)  is  the 
positive  root  of  the  equation 


hr(x)  ‘ [7X7  e 


. - %(2A+e)e 

L2A1  _ 2aZ  2 1 

h[-^  ax] 


= 0. 


cosr 


Now,  consider 


u»:  - 


f (X)  = XI  e 20 
Tn'x'  L1Xi  e 


- 1 


AX 


u»:  - h ^ >*'>x 


g(x)!Pe  2° 

L1xi  e 


n A v 

^ AX 


- 1. 


2e 


Because  for  x > 0,  we  have 
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P?(A+c)x  -n7(A+c)x  - n_.(A+e)x  n7(&+c)x 


+e 


T 


AX 


2el 


^jr  Ax  * AX 
e°  +e 


--  n 

T 

_0 


AX 


so  gn(x)  < hn(x)  < f „( x ) for  x > 0.  Let  r^(n)  and  s^(n)  be  the 
only  positive  roots  of  9n(x)  * 0 and  fn(x)  * 0,  respectively,  we 


get 


but 


rj(n)  > t^(n)  > s^n), 


l2x; 


r,(n) 


- 7(2a+c)e-  t— ) 

2 a 1 


and 


n 

Tc 

O 


ne 


s((n) 


-^(^A+cJe^,^)  ok-V) 

2o  L1X1  _ e L1X 1 

H A+  J ■ 


ne 


hence. 


1 ini  r.(n)  * lim  s.(n)  * A+  £ for  any  L, , L?,  x.,  X',  o,  6,e. 
n«.  n-^>  c 1 i 

So,  lim  t^(n)  * A+  4,  which  completes  the  proof. 

n-K* 

Now  we  give  the  proof  of  Theorem  1.5.1, 


Proof  (of  Theorem  1.5.1.): 

2 

Now,  Xjn  v N ( o i , ~).  For  |0j-0q|  < A,  let 


g(e 


t>  ■ Eo,[{*(Stn”  - Prt-*1<">  i V90  i *1 


(">] 
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.(V-K-i)  ,(-ti1")Vt) 

<s/Jr\  o/^n 


If  we  recall  that  g(e>0Q)  = g ( ©q-0 i )»  ancl  9 is  decreasing  for 
e.  > en,  then  inf  g(e.)  = g(en+A)  = g(en-A).  Hence, 

|01-9oI^a 

t,(n)-A  -t . (n) -a 

inf  E [«t(Xin)]  = *(-! ) - *(—4—). 

|0j-eo|  £ A 9i  a/^n 


lim  inf  E )]  = $(o°)-<t(-°°)  = 1. 

n-~  |ere0|  < a ei  1 in 

Similarly, 

sup  g(ej  = g(en+A+e)  = g(eft-A-e). 


Hence, 


t-(n)-(A+e) 

sup  E [6$(X.  )]  = lim[*(— ) 

1 9 i -9q  I £ A+e  i n-»®  a!  Jx\ 


-t,(n)-(A+e) 

-♦(— )] 

o/vTT 


= 4>(-oo)-$(-oo)  = 0. 


This  completes  the  proof  of  Theorem  1.5.1. 


Remark:  If  0q  Is  unknown.  Theorem  1.5.1.  becomes 

<"[ 

n-*«>  1 o -j  “®q  1 £ A i 0 


and 
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lim  , sup  Eo  -a  m 0 . 

n-*«  le^-^Q ( >A+e  G1  90  1 1n  0n 

The  proof  is  similar  to  that  of  Theorem  1.5.1  and  hence  it 

Theorem  1.5.2.  lim  sup  r(-r,S*)  = 0,  where  5*  = (<s* 6*) 

n-+®  t€r  ' '•Ik 

minimax  rule  we  found  in  Theorem  1.3.3.  with  x^  replaced  by 

Proof:  sup  rft.A*)  < J sup  r^(T,6*). 

T^r  ~ i = l t€T  1 


Now, 

sup  r<’l(r.«;)  = ^/',e1-9o|<ALl(1-E0<t<tX1n)ldT(e1) 
+ ^|fli-eo|>A+eL2E0i^i^in^dT^i^ 


< L^.d  - inf  E [MX,  )])  + L?x;  sup  E„ 

|ere0|<A  9i  1 in  2 Vre0|>A+c  e 

lim  sup  r(r,<5*)  £ J lim  sup  r^(t,6?) 
n-«o  Ttr  ' ~ i=l  n-»  t€T  1 

<Lv\.(l-lim  inf  Ee  [«!(*,„)]) 
n-*®  |e1-e0|<A  { i 1 1n 


+ L?Xi  lim  sup  Eo  C^(X4n)]  * 0 • 

n-»  |ere0|>A+E  0i  1 tn 


Again,  when  eQ  is  unknown,  11m  sup  r(-r,<$°)  = 0 is  also  true. 

n-+®  ter 

Theorem  1.5.3.  When  o0  is  known,  the  r-minimax  rule 
•S*  s (A^t....6k)  is  admissible. 


Is  omitted. 
Is  the  r 

*1n* 


^6i(Xin^’ 


Definition  1.5.1.  If  x Is  a measure  on  ©,  and  If  6g  satisfies 

Inf  /d  L (0,6(x) )f(x| 0)dx(0)  = /0  L(0,6o(x))f(x|0)d-r(0)  < °°  , 

6 

then  6q  Is  said  to  be  a generalized  Bayes  rule  for  t. 

Remark:  6g  Is  a generalized  Bayes  rule  can't  guarantee  r(x,  6g)  < 

Lemma  1.5.2.  If  | L ( 0 , a ) | <M  for  some  constant  M,  then  6Q  Is  a generalized 
Bayes  rule->for  all  6,  / [R(0,<5)-R(0.eo)]dx(e)  >0. 
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Proof:  For  any  <5, 

/x/q[L(0,6(x))  - L(e,<50(x))]f(x|e)dT(e)dx  > 0 . 

But  since  L(9,(S(x))  - L ( 0 ,6Q( x) ) is  bounded  from  below,  by  Fublni's 
Theorem, 

|e/x[ L(0,6)-L(9,6o)]f(x|0)dxdT(0)  >0  , 

i.e., 

/q[R(0,6)  - R(0,6o)]dT(0)  > 0 . 

Definition  1 .5.2.  A generalized  Bayes  rule  6g  wrt  t is  unique  up  to 
equivalence  iff  for  any  rule  6, 

4 [R(0,6)  - R(0,<So)]dx(e)  = 0 

->  R(0,6)  = R(0,6O)  , v 0. 

Remark:  Let  6Q  be  a unique  generalized  Bayes  rule  according  to 
definition  1.5.2.,  then  if  6 is  any  other  generalized  Bayes  rule  for  t, 
we  have  R(9,6)  = R(9,6g)  for  all  9. 

Lemma  1.5.3.  If  L(9,a)  is  bounded  and  if  the  generalized  Bayes  rule  6g 
of  t is  unique  up  to  equivalence,  then  Sg  is  admissible. 

Proof:  Let  <5  be  such  that  R (0,6)  £ R( 0 ,5g)  then 

/0[R(e,fi)  - R(9,6g)]dT(0)  < 0 . 

By  Lemma  1 .5.2, 

/0[ R(0.«)  - R(0.«o)]<It(0)  - 0 . 

Now  by  the  uniqueness  of  <Sg,  we  get  R(9,<5)  * R(9*6q),  which  completes 
the  proof. 
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Theorem  1.5.4.  When  0Q  is  unknown,  the  r-minimax  rule 
6®  * (6^,. . . ,6®)  is  admissible  in  Dj . 


Proof:  Let  t = (Tg,u>0  ) be  the  measure  on  0 such  that  tg  is  Lebesque 
measure  on  @g,  and  with  0Q  given,  ,e2,. . . ,9k  are  independent, 
such  that 

V 5 eo*‘!eo’  ‘ PU(>  ■ V4l6o’ ' t 

eo  eo 

\ t0i  = 6o+A+eleo]  = % C0i  = 0o-A-£l0o]  = T 
eo  eo 

pWfl  t0i  = VA+f  ,0o]  = 1 - xi  - xi  « 

90 

then  for  any  6 = (6-j, ...,6^)  € D-j  , 

/0L(e,6(x))f(xle)dT(e) 


= ill  Too  2 Ll(1“6i(xO’xi))[feo+A(xi)+f0o-A^xi^feo(xo) 
+ T L 2 6i^xo’xi^fen+A+e^xi^+f0„-A-e^xi^f9„^xO^  d0O 


= il]  D Ll*-feo+A^xi^+f0Q-A-xi^f0g^xO^d0O 
k rx;  x.-xn+A+e  x.-xn-A-e 

+ l — ) + fo(“4r — )] 

i=l  101  list  <■  u ft  0 SZ. 


X.  x.-xn+A  x,-xn-A  1 

* fof-Lz5-)]  !• 
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Hence,  if  S^Xq.x^  is  defined  by  (1.4.6),  then 

6°  * (6®,..., 6^)  is  a generalized  Bayes  rule  wrt  t.  Now,  if 

/8  " R(6.«o)dl(®)  = 0 • 


i .e. , 


/x^[L(0,«(x))  - L(e,6°(x))]f(x|fl)dT(e)dx  = 0 


by  Fubini's  Theorem,  so 


v r r r , ,o  . x.-xn+A+c  x.-xft-( 

^li-J.3(X0’Xi)'61(x0‘Xi)]  T L2[f0(  ~ )+f0( ^T 


xi  x.-xn+A  X.-x.-A  ) 


Hence, 


lSi  (x0’xi ) ~ 


„ Xi‘xn+A*e  x<-xn-A-c 

,f  4 * (-Lfc-): 


X,-Xn+A  X.-Xn-A 

< i] 


,f 


Iqain , 


f (x0»xi ) i 2 ^ i ^ ^ ^ 


- x.-xn+A+c  x.-xn-A-e 
W-L-0 , + h_l_0 , 


x.-x+A  x.-xn-A 

’ L'X'U(^— 1 * 4'(^~)1  - (Vxo  * i V 


X 0 . 2 

P[x.-xn  = + tj  = 0 , since  (y1)  - N ((J)  , (°  2)), 
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So, 


6l(*rxo>  ■ ^(xj.Xq)  a.e.  . 


Then, 


R(e,<5)  = R(e,6u)  . 


,0 


By  Lemma  1.5.3.,  5 is  admissible.  This  proves  Theorem  1.5.4. 


When  6q  is  unknown,  we  have  restricted  the  decision  rules  to  the 
class  D-| . It  is  quite  natural  and  reasonable  for  us  to  do  this.  However, 
we  may  still  like  to  know: 

a.  Is  <5°  = (6®,..., 6®)  a r-minimax  decision  rule  in  D rather 
than  only  in  0^? 

b.  Is  6^  admissible  in  D? 

We  leave  these  questions  as  open  for  further  research. 


1.6  Relaxing  the  assumption  of  normality 

As  was  remarked  in  Section  1.3,  the  assumption  of  normality  is  some- 
what restrictive  to  our  problem.  In  this  section,  we  will  investigate 
some  more  general  distributions  for  which  r-minimax  rules  exist. 


Theorem  1.6.1.  Assume  0g  is  known.  Let 

A1  = {x1  lA1L2^fe0+A+^xi^+fe0-A-c^x1^  - X1Ll^feo+A^xi^+f0o-A^xi^}' 


Let 


9(6^  = E0^[IA^(Xi)]  where  X^-fg  (x).  If  g^-*^)  * 9(00-0^)  and 

g(0j)  is  decreasing  for  9^  > eQ,  then  6 * (6^,...,6|c)  is  a r-minimax 
rule  where  6j(x)  = IA  (x^)  for  1=1 .... ,k. 


SHI 


P, 

« I 


i' 


Proof:  Let  t*  be  defined  as  in  the  proof  of  Theorem  1.3.3.,  then  the 
Bayes  rule  for  the  1th-component  problem  wrt  t*  Is  given  by  6^(x^)  * 

IA. (xi ) for 

Now,  since  9(6^  +0q)  = 9 ( eo~e i ^ and  9(0j)  decreasing  for  > 6^,  so 

sup  9(9, ) = g(9n+A+e)  = g(0n-A-e) 

|ei-0ollA+e  1 0 0 

and 

inf  g(e . ) = g(6n+A)  = g(0n-A)  , 

|er0ol<A  1 0 0 

i.e.,  6^  satisfies  (1.3.1).  Hence  by  Theorem  1.3.1.,  6 is  a r-minimax 
rule. 

The  following  example  applies  Theorem  1.6.1.  to  select  some 
binomial  populations  with  large  entropy. 

Definition  1.6.1.  For  a binomial  distribution  b(n,p),  its  entropy  is 
defined  as 

l'(p)  = - [pfnp+(l-p)f«(l-p)]  . 

Note  that  ^(p)  is  associated  with  the  uncertainty  or  randomness  of  that 
population.  The  larger  the  4<(p).  the  stronqer  the  randomness. 

Example  1.6,1.:  Suppose  are  k independent  binomial  popu- 

lations with  ni  - b(n,p^).  We  define 

n1  is  positive  iff  ip(p^)  > 6 + e' 


and 


JT.  is  negative  iff  4»(p^)  < B . 
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Figure  2.  'Graph  of  iMp). 

% 

Equivalently,  we ‘can  say  that 
IIj  is  positive  1 ff  | p - - [ 

and  ni  is  negative  iff  |pi  - »j|  ^ A + e, 

where  ij/(»j  + A)  = B + e'  and  \\>(h  + A + e)  - Q. 

It  Is  seen  that  eQ  = h in  this  problem.  Let  r be  given  by  (1.2.2)  and 
the  loss  given  by  (1.2.1)  with  eQ  = h and  = p.,  then 


A1  * <xllVl[ph-A-e(x1)tpWMe(x1)]  iLlN[Va(x1)*|,i.-4(x1l]1 

where  P6(x)  = (£)ex(l-0)"'x  , we  find  t Af  Iff 


i \ * *.•  n-x.  a.  ii-a. 

2 * (*a-A-c)  (V»-A+e)  +(>a+A+e)  ^(^a-A-e)  - hfx  1 < 1 

£7x7  x4  n-x.  x.  n-x.  v i'  - 

1 1 (*j-A)  ' ( •s+A ) t(%+A)  ▼(■j-A)  1 

Also,  It  is  obvious  that  h(j  - x.)  = h(j  + x..) . After  some  messy  compu- 
tation, we  get 

x.  n-x.-l  x.  n-x.-l 

hJx^D-hfx^  = c(*j-A-e)  1 (*i+A+e)  1 (e(>j-A)  ’(V+A) 

t(££f  • ^)2Xl+1'n-1W2Afe)(W)X,(-,-4)n'Xr1 

2x1+l-n 


n-x. 


[(ilA  . iiA+£)  _i  ]}t 

lVa  >j-A-e;  JJ’ 


s 

i 
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then 


h(Vl)  > h(xi)  1ff  x1  > 


hfx^l)  = h(xi)  iff  x 


n-1 

i = — 


h(x.+l)  < h(xj)  iff  x.  < 


n-1 


Hence , 


A1  ■ (XilMx,)  i 1)  • («,  If-  «,  < x^f  + 

for  some  Integer  mf.  It  follows  that 

5(Pi’  ' VV*f  n * P[?  ' "I  1 *)  1 ? * "i1 


?*  "t 


v , n xi  n_x< 

£ (x  ) Pi  O-Pj)  if  n is 
H _ Xi  1 1 


even 


xi  = 2 " mi 


n-1 

~y~  + m. 

v 1 , n,  xi  n"xi 

t-  (x  ) pi  H-Pj)  if  n is  odd  . 

_ n+1  _ i 1 

_ ^ 


xi=~r  - m 


% 


= g(i-P1)  , 

i.e.,  g(^pf)  ■ g(*j-  pf).  Since 

(xV  P,XtO-p/-X*  - (l-pf)n  («)  eX^(A> 

pi 

and  61  is  1ncreasing  in  pif  so  by  Corollary  1.3.1,  g(Pi)  is 

decreasing  for  p..  > s and  increasing  for  p.  < ij. 

Now,  by  Theorem  1.6.1.,  6 * (6. 6 ) with  6,(xJ  * I (Xj) 

' 1 K » « rn  m n j.  „ i i 

*■?“  mi  *?  + mi  J 


is  a f-minimax  rule. 
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Definition  1.6.?,  X * f0(x)  Is  said  to  be  a PF  density  (polya 
frequency)  If  fQ(x)  * f(x-e)  Is  TP. 

Theorem  1.6.2.  (Karlin)  If  X ~ f Q( x)  Is  a PF  density  and  f(x)  * f(-x), 
then  | X | Is  TP  (hence  TP,,)  for  e > 0. 

Remark:  The  density  of  |X|  Is  [fp(x)+fe(-x)]  Ij-Q  ^ ( x ) , so  by  Theorem 
1.6.2,  we  can  assert  that 
f(x-e2)  + f(x+e2) 

f (x-9j ) + f(x*e^)  (1.6.1) 

is  Increasing  In  x for  x > 0 and  e2  > . 


Theorem  1 .6.3.  If  X^  has  a PF  density  f0  (x)  * f(x-G^)  and  f(x)*f(-x), 
then  the  assumptions  of  Theorem  1.6.1  are  satisfied. 


Proof:  We  need  to  show  <j(q+0g)  a g(0g-0^)  and  g Is  decreasing  for 
> 6g.  Let  y^  * x^-0g,  then 

, L?X;  f(y,+A+e)  + f(yrA-c) 

Ai  ’ {yi+eol  qq  fTy'^AT  v TTy^-lj i 1}  • 

Let 

f(y<+A+c)  + f(y.-A-e) 

h(yt'  ' ” fry,w*  f(y,-s)  ’ 

then 

f ( x)  - f(-x)  ~>  h(yj)  - hf-y^  . 

Also,  from  (1.6.1),  h Is  Increasing  In  y^  for  y^  > 0 , 
so  Aj  - (yi+0o|-t1  < y^  < tj).  Then 

9(6^  • e 11^)3  - P[-t^0o  < 7+e,  < t1^e03  , 


where  Z - f(>  tg(x).  Since  Z ~ -Z,  hence 

g(<VV  * g(0o-ei)  • 

Now,  by  the  remark  of  Corollary  1.3.1.,  we  get  g Is  decreasing  In 
for  0.  > Oq.  This  completes  the  proof. 

c,  'c(|xret| 

Example  1.6.2.:  If  fft  (x)  » e for  1*1,2,... ,k  , 

where  c^'s  are  known  constants,  then  the  r-mlnlmax  rule  Is 
a = (vS, ,.. . ,M  with 

- 1 _ I e\  a I 


W ■ 


1 If 


...  r,-ci,VV"l  . -c,l«,-v4t'l 

le  + e J 

~ -c1|,)-e0-4|  -c1|x1-o0*A| 

a.l,  Le  + e J 


Proof:  The  result  follows  directly  from  Theorem  1.6.3. 


1.7  Bayes  rule  and  the  minimax  rule  for  selecting  populations  close 
to  a control 

In  Section  1.3,  we  assumed  that  partial  prior  Informations  about  s 
are  known  and  that  they  are  summarized  In  the  class  r.  In  this  section, 
we  will  consider  two  extreme  cases,  namely,  either  complete  Information 
or  no  Information  about  « Is  known.  Correspondingly,  we  are  looking 
for  the  Bayes  rule  or  minimax  rule.  The  problem  will  be  treated  under 
the  assumption  that  Og  is  unknown. 

The  following  lemma  may  have  been  used  by  many  people  Implicitly, 
but  it  Is  worth  stating  It  out  explicitly. 

Lemma  1.7.1.  If  (t$,D,L)  Is  a decision  problem  and  If  for  any  <ScO, 

k ip 

A = (A.,...,\)  and  L ( o , vS ) * T L'  '(0,6.)  [1 . e. , the  loss  Is  additive], 

IK  1-1  1 
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then  for  any  prior  distribution  t on  8,  6°  is  a Bayes  rule  of  r if  5? 
Is  a Bayes  rule  of  t for  the  1bb  component  problem. 

Proof:  r(T ,6)  « £ r{1)(T.6j  > [ r(i)(T,«?)  = r(x,6°). 

1»1  1«1  1 

Let  us  assume  8^  ~ r^(8)  - N(oij,82)  and  0^'s  are  independent 
(0<1<k),  then  since  Xf|ei  - N(ero2),  we  get 

+ XjB?  o2  02 

Vxi  - N ( r ' 2"  * T‘  = N(a1‘  bi>  • 

o + <T  + b‘  11 

The  Bayes  rule  of  the  1th  component  problem  Is  to  minimize 

/xfL1(i’<si(?))^  |01-0o|<AdT(®l?)  + L26i^)/|ei-e0|>A+edT(e|x)Mx)dx 

(1.7.1) 


where 


and 


n,(;)  ■ v0y*^,<v  ■ ty”vx<,dT<<e<’  ■ 

„ , , , ,V9t(X,><lT<(<’'>  “ S(X1)dT1(e)>  k 

dT(?|x-> m tI0-Vp — -tVT'<9',Xi)  • 


Hence,  (1.7.1)  reduces  to 


L>Vi6t-e0i<4dT<9J!)’";)'1!;*/xa2/|er6oliWcd,(e0ix0)d,(6,ixt: 


Ll/|e<-0nl<AdT(eolxO)dT(0llx1)^l(x)  n dT(0JxJm(x)dx 
1 o • j/O  J J ' ' 

JfH 


So  the  Beyes  rule  Is 


61 (xo,x1 > 


l if  L2P[|ere0!>AH|x0.xi]  < L1P[|ere0(<&|x0,x1] 
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2 . .2, 


But  9^"0olxo*x1  ' N(ai-ao»  bi  + b(V’  tbus  we  bave 


61 (x0*xi ) 


i if  l2[i-K 


A+e+an-a<  -A-e  + aft-a. 

i-  ■ ° -1)  * 

y^r  j bo*bi 


A+a„-a.  -A+an-a. 

< L,[>(— 5-1)  - ♦( 5_i)  ] 

- I n — 9—  / "T  1 


,b0+b1 


0 > 


LoD-^yt+A'+e'  )+*(yj-A'-e' )] 

1 if  h(y.)  * 2 1 I 1 

♦(y^A* )- «<y1-A' ) 


where 


i if  ly^i*?  P if  larao>^?  “ s?ibo+bi 


(1.7.2) 


Vai 


■- — , and  e'  ■ — - 


(Vbt 


The  last  equality  holds  because  h(y^)  * h(-y^)  and  the  fact  that  h(y^) 
is  increasing  for  y^  > 0.  We  have 


7 

Theorem  1.7.1 . Assume  has  Independent  prior  distribution  N(a^.B^). 
1*0,1,.  ..,k,  then  the  Bayes  rule  is  SB  ■ (<SB,...,6B)  with  6B  defined  by 
(1.7.2). 


Vi; 

mum 


is  the  Bayes  rule  for  1 component  problem,  since  L 


Hence  Lenina  1.7.1  asserts  that  <5 


The  following  lemma  is  essential  when  we  search  for  minimax  rules 


Lenina  1.7.2 


Proof 


is  increasing  in  x 


But  when  x 


hence 


f(x)dx  - F(-t-4)-F(-t-^) 


(11)  Similar  to  (1) 
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Theorem  1.7.2.  Let  A^a  and  A'«l-a.  Also,  let  6 * (6j 6fc), 

where  ^(x^)  * I[t1  ,ti ](xrxo} ’ be  the  corresponding  r-minimax  rule 
in  D^.  If  a is  chosen  so  that 


-t.+A 

L] [#(-  1 


/ To 


) + T>( 


-t^A 
t'T  a 


t.+A+e 

■)]  • l2MJ ) 

/?  a 


then  A is  a minimax  rule. 

Proof:  For  e € qq(  i ) , 

R(()(f.«f)  - L,  P[|x,-x0!  > t(|o0.ef] 


*(■ 


-t>A+c 

/f  o 


-)]  . 
(1.7.3) 


- L,  [*(■ 


+ ,(:v<vv 


/To 


STo 


)] 


r -t,-A  -t.+A 

< L O(-L-)  + *(_L_)  3 

JT  a /2  a 


* 


by  Lenina  1.7.2(i).  For  e €QD(i)  , 

*■  D 


R(i)(e,<v  = l [♦(^ir.(9<"9o)) . f(:tr(flreo))3 

JT  o rto 


t,+A+c  -t.+A+e 

< i [♦(  J ) . «( — ! 

JT  a /To 


)] 


by  Lemma  1.7.2(ii).  And  for  etf  iJiJUgJi)  , 

^ D h 

R{i)  (e.^)  = o . 

But  in  the  proof  of  Theorem  1.4.1,  we  have  shown 
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lim  Inf  r(i)(xn,6jn)  > L^O-E^A^)]  + L2(]-a)EA+£ [5^ (Y.)] 


-tj+A  -tj-A  t.+A+e 

L^W-r- ) + *(— r — )]  + L>(l-a)0(^ ) 

' S?  a fl  a 2 y/T  a 

t.+A+e  -tj-A 

L,[  (4 ) + *(4— )3 


t.+A+e  -t.+A+e 
Lot  (4 ) - *(-z — >] 


t.+A+e  -t4+A+e 

; \ A/  * i 


> Sup  R^(0,6.) 
0 


Hence , 


r(T"’S")  1 J,  H:inf  r<')(Tn-S?r,> 
n-*»  - i = i n-*» 

k k 

> l sup  r(4mJ  > sup  [ 
i=l  0 ~ 1 0 1=1  ' 1 


= sup  R(0,6)  . 

6 

This  proves  6 is  a minimax  rule. 


One  may  wonder  the  existence  of  such  ana(0<a<l),  so  that  (1.7.3) 
holds.  We  will  show  that  they  do  actually  exist.  We  know  t^'s  are  the 
positive  roots  of  the  equation 


ha(x) 


L (1-a)  " ~~ ^2"  (2A+e)e  cosh(— ^ (A+e)x) 


cosh( — Ax) 
2 a 


- 1 * 0. 


.5 


Figure  3.  Graph  of  h (x)  for  a = .1,  .5  and  .9. 


One  can  see  ha(x)  is  decreasing  in  a for  fixed  x,  this  implies  is 
increasing  in  a.  And  wher,  a 1 , we  have  -*■  so 


-t.+A  -t.-A 

L1  — ) + ^ -*  0 
& O o 


t.+A+e  -t.+A+e 

Wi—'i  - *<4— » * l2 


On  the  other  hand,  when  a -*•  0,  h (x)-l  is  positive  for  all  x,  so  there 

Q 


exists  some  a q such  that  t^  » 0,  then 


-t.+A  -t.-A  . . 

) + *(—!_)]  = Li  [♦(•—= — ) + *(-—)]  = L, 


It  is  clear  that 


-t.+A 

t 1 


L [♦(_!_)  + *(—!—)]  - U|>(— ! ) - *( 


t.+A+e 


- 1 j +A+e 


is  continuous  in  a,  hence  there  exists  a*  ( a^<  a*  < 1)  such  that 
(1.7.3)  is  true. 


■tr 


4p 


In  the  next  section,  a*  will  be  found  for  some  selected  values 
of  A and  e.  (1^  * Lp  = 1 ) . 


1.8  Comparison  among  Bayes,  r-minimax  and  minimax  rules 

When  we  face  a decision  problem,  the  prior  information  has  a very 
Important  influence  on  our  choices  of  the  optimal  rules.  In  general, 
one  would  use  the  Bayes  rule  if  the  prior  distribution  is  known,  use 
the  r-minimax  rule  for  incomplete  prior  information,  and  use  the  minimax 
rule  when  no  prior  information  is  available.  The  comparison  of  these 
rules  will  give  us  some  idea  about  how  far  our  decision  is  from  the 
real  optimal  rule  if  the  prior  information  we  have  is  incorrect.  In 
other  words,  we  are  interested  in  the  robustness  of  each  rule. 


In  this  section,  we  make  a thorough  comparison  among  these  rules 
In  terms  of  Bayes  risk,  the  maximum  risk  over  r,  and  the  overall  maximum 
risk.  Because  the  loss  is  assumed  to  be  additive,  the  comparison  is 
made  for  the  1st  component  problem  only.  Sub-index  i will  be  omitted 
from  the  notation  and  AR(x)  = I^_t  t -jU^-a^,  6g(x)  * I^_t  t ](xi‘xo)’ 

and  6m(x)  = Ij- t -j(x^-Xq)  mean  Bayes  rule,  r-minimax  rule,  and 
the  minimax  rule,  respectively.  It  is  also  understood  that 

dTB^0’6l^  = dT0^0^dTl^l^*  where  Ti^ei^  ~ the  P^ior  and 


ajo  + x1e1 

5 5 

o + e: 


for  1*0,1.  Also,  x^  * PT  [ I e1 -eQ|  < A], 

B 


* PT  C I eTeo!  - A+e^*  ? = ^xo,xi^’  and  0 * (0o,0i^ 
8 


will  be  used  In  this  section.  Now,  the  Bayes  risk  of  the  Bayes  rule  Is 


-I 


3 
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r<TB*V 

= L1P[|a1-a0|  > tR,  |0,-eo|  < AT 

+ L2PC!ara0l  < tB,  |er0o|  > A+e]  . 


Since 

1 % 

1 ? ? o \ \ 

X,1 

~ N 

i 

o *6,  8* 

h 

W ' 

i 6?  6f// 

/ 

for  i = 0,1,  then 


(ara°K  n f°r<M  fc°o  + wi 
VerV  >°rV  ' 2 , 2 


2 2 2 
w0  + W1 


2 2 2 ' 
U)0  + U)1  U)0  + Wl 


2 01 

where  w.  = *■  , for  1 = 0,1.  Let  d * a.-an  , 

ac  + &.  10 

l 


u?  = Sq  + v?  = u)q  + and  p = ^ , 


then 


\z2  / 


arVd 

erVd 


- N 


■ (’  °) 
vo  r 


Hence, 


tB-d 


■vd 


r<VV  = LlPC(zl  ' "T"  or  VHH  and 


A-d, 


,"tR_d  ti»“d  -A-c-d  A+e-d 

* 4pl(— iZ,  5-V)  and  (— □ i z2  or  Z2  > — jj— ) ] 


-tn+d  A-d  -t.+d  -A-d 

■ L,CF(-f-  . — ; -p)  - F(— S—  . — ; - p) 


-tR-d  A-d  -tR-d  -A-d 

+F(~-  . — ; p)  - F o)3 


V * u 
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_ ,Vd  -A-e-d  -t„-d  -A-e-d 

* L2[F<  v • S-  » »)  -F<— T"  • —3—  s 0) 

tR-d  -A-e+d  -tD-d  -A-e+d 

* f(-t-  • -5- 1 - F<~ r~  • -3—  '■ 

where 

F< x0‘y0;  p)  3 pC21  < x0,  Z2  < yQ]  . 

Similarly,  we  can  compute 

= ^PCIXj-XqI  — ^1  ”^0  ^ 

+ L1P[|xrx0|  > tG,  |©re0|<  A]  . 

Now, 

2 2 2 2 2 3 

o + 6*  + o4  + 8T  6n  + 8; 


0 

X 

1 

X 

/ 

-Nl 

“rao / ’ 

Jo 

2 + 2 

6o  *5 


a*-  + 
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•tg+d  A-d 


r(TB.5fi)  = Lj[F(  r ♦ u '»  -o')  - F(  r 


tfi+d  -A-d 


V) 


-tR-d  A-d  -tft-d  -A-d 

+ p(-7-.  — ; p1)  - F(-f-,  — ip1)] 


tg-d  -A-e+d 


+ l2[  h~  , — , -p1)  - n 


■tg-d  -A-e+d 


; -p1) 


tr-d  -A-e-d 

♦ F (-1-  , 

' r u 


-tr-d  -A-e-d 

i »■ ) - F(-S- . — — ;o'))  , 


where  p*  * £ . Since  <SR  and  6^  have  the  same  form  except  for  the 
constant  tg  and  tM,  so  when  we  replace  tg  by  tM  in  the  above  formula, 
we  get  r (tb,«m). 


I 1 1 


The  next  thing  we  are  going  to  compute  Is 

sup  r^(T,6),  for  6 » <SR,  &r  and  6M. 

ter  8 fi  M 


r ' VT.6)  = / 


lei _eol— aLi^1E0(6^~^  ^ 3dt(o) 


+ ^|0reo!>^W^?)^(e) 


(1.8.1) 


Lemma  1.8.1. 

sup  r(1)(T,S)  = L-j ( 1 - Inf  E [6(X)])X,  + L?  sup  E_[6(X)]xr 

Ttr  |ere0|<A  “ ~ 1 2 |0r0o|>A+e  ? ~ 1 

Proof:  < is  trivial.  To  prove  the  other  inequality,  let  us  consider 
two  sequence  {0n)n“  and  (6'}^  such  that  0p  (1)  and  e'  fceB(l), 

and  E [6(X)]  - inf  E [6(X)]  and  Ett.[6(X)]  - sup  E_[6(X)]  . 


(ere0(<A  ! 


~n  ~ f0^9ollA+e  H 


If  we  define  xntr  as  Pt  [0=enJ  * X1 , P^  [e=  0']  = Xj” 


and  P^  [o  £ 8 R(1 ) U <a>B(l) J = l-XrXf.  then  we  have 


1 im  r^  (t  ,5)  = L,(l-  inf  E [fi(X)]X.  + L?  sup  Efl[6(X)]X{ 
n-  1 |ere0|<A  8 ~ 1 2 |0r0o|>A+c  8 ~ 1 

Now,  sup  r^(x,6)  > r^(t  ,6)  for  all  n,  and  if  we  take  11m  , 
r€T  n n-»<» 

we  get  the  result. 


From  the  above  lemma  and  the  proof  of  Theorem  1.4.1,  we  get 
*S  '•‘’’('•V  • * L,»,-E„  rs,(Y)]  . 


where  Y * X^Xq  ~ 2 ac).  So, 

sup  r(1,(x.6G)  « L1xip[|xrxo|>trj0roo-A] 


+ ‘-2X{P[|X1-Xo|<tG|Or0o=A+e] 


"VA 


r i,  "V,+A  tr-A-e  -tr-A-e 

. L^X'j c*(~~  ) + ) - $( — )] 

72  a ^ a 71  a 


Hence  we  also  get 

/ i ] -tM-A  -tM+A 

sup  ru;(i.AM)  « L. Xi[<&( — - — ) + «P( — ^ — )3 
t€T  71  a 71  o 


tM-A-e  -tu-A-e 

♦ L2XfC<**(-^= ) - ♦(-£ )] 

71  a 71  a 


"VA  -tM+A 

■ (Xi+Aj  L, [*(-—-)  + ♦(-=-)]  • 

' * ' 71a  71a 


.(!)< 


To  find  sup  r'  ;(t,(5r)  will  need  some  more  work.  First  note  that 

■tfr 


ara0,eTe0  ~ )»  where 


si  ei 

u = — ? 


Bn  e 


2 2 
a,  a oiq  cj 


p0  n0  . , “1  ° a0  ° % 
T~T  {JT73  " 7T~T) 


Bf  + <T  Bq  + a"  B^  + ol  Bq  + o‘ 


and 


2 B?  a2  B04  o2 


* ^ 9"1>  + 5T"o  • 

(bJ+o2)-  (Bq  + /)2 


Then  let 

9(u)  • Eft[6R(X)]  * E,r-tn  < a,-an  < t0] 


we  find  g(u)  - q(-w)  and  q(u)  Is  decreasing  In  |vj|,  by  Lenina  1.7.2(11) 
Now  let  us  consider  the  following  two  cases. 

62  82 

(a)  If  ^ fi  B2,  then  -g-  — ? • And  if  we  let 

01  + o ' 0Q  + O 

°1  = 0q  * 1 °°»  we  have  | p | -*■*>.  So  we  get 

Inf  E [6_(X)J  * lim  g(p)  = n.  Also,  when 

|ere0|<A  5 8 - (mIhkx, 

t Pq  ’ f^|l|=0)  ^ (0|  ( 0-j - 0q ( 2.  A+e}  f $ . 


Figure  4.  Graph  of  u = 0 on  0^ -plane. 

Hence , 

sup  E [6  (X)]  * g(0)  * ♦(-!)  - . 

Iei"®0l— A+e  ~ c c 

By  Lemma  1.8.1., 

sup  r(1)(r,6B)  « L, X,  + L2X7 [*(■—)  - *(—•)]  . 


(1.8.2) 
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(b)  If  * 6q  = e2  , then  u = -y  6 ? (0r0Q)  + y°  - . 

0+o  1 u ft*  + o6 

(1)  If  o1  > a0,  then  under  |e1-eQ|<A.  |p|  has  Its  largest  value 

02A+a2(ot..-aft) 

when  0]  - 0q=A.  In  this  case,  let  p.  * — * J ■ 0 , so 

1 ft*  + O* 

|9;S0V!C5°1?,J  ■ • 

If  a1  1 ao’  under  ^ 0 1 ~ eo I — A * ll'l  has  its  lanJest  value  if  0^ -0q=  -A,  then 


where 


02A+o2(ag-a.| ) 

~t  7 2 

0+o 


So  we  get 


^ ^ • 


where 


A02+o2|a1 -a0| 


0 + a 


(ii)  To  find  sup  Eflr6_(X)],  let  us  note  that  u*0  iff 

|0r0ol>A+e  2 B - 


(oiq-c^)  = ei"eo-  Then 


(1)  If  ^“o-01!  llA+e,  then  {p=0}0{  le^BpI lA+e}  i «j>  . 


, . Ee[VX>J  " - ♦(^  • 

|0r0ol>A+e  2 8 - 5 * 
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(2)  If  ^laQ-a^  |<A+e,  then  under  I^-BqIiA+c,  |u|  has  the 
02(-A-e)+o2|a1-ao| 


smallest  value  when  u=u 
then 


-x w- 

r + oc 

tD-u, 


2 ? 2 

To  sum  up,  If  81  = * 8 , then 


L1X1^  C 
1 


"VM0x  . /"V^Om  a , r^Bx  Af_t Bx 


L)  + (_JL_ii)]  + L2[*(-^)  - ♦(-£)] 


1f  t'"o-01i  I-a+e 
b 


JD, 


;§ r (T-v  - ^ 


W -tp-u' 


+ ♦(limo)] + l2[$(^L)  - «•(— |— )] 


1f  T^V01]  l<A+e 

0 


At  last,  let^*  ■ (t|t  Is  a distribution  onQ},  we  want  to  compute 
sup  r^(T,6  ),  sup  r^(t,<Sr)  and  sup  r^(T,<5M)  . 

itfti  * t60*  1 T^i* 


Lemma  1.8.2. 

sup  r^1}(T,6)  » max[L,(l-  Inf  E[6(X)]},L2  sup  E [6(X)]] 

le1-eQ| <a  ~ ~ |e-,-e0|>A+E~  ~ 


Proof:  From  (1.8.1),  for  all  t£q * 

r(1,(T,«)  < L^l-  Inf  EeL6(X)])PT[|ere0|<A] 

I ei “eo  A ~ 

+ L2  sup  E0[6(X)]PT[|01-eo|>A+c] 

lVeol-A+e  - 
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± max  (Li(l-  Inf  E[A(X)J),  L sup  E[6(X)J). 


|oro0|<A  ! 


|0r0ol>A+e 


Now,  let  0n  and  0'  be  the  same  as  In  Lemma  1.8.1,  but  we  let  t be 


such  that 


Pt J>?n]  = 1 if  Li(1'Eo  1UE  ,[6(X)] 

n ~n  " Zn 

PTnC!’?n]  ■ 1 ,f  l-jd-E,  [«<;>]  <L2Ee.[6(X)]  , 


r <VlS)  = max(L1(l-E0  [6(X)]).  L?Ep.[6(X)]) 


- maxa^l-  inf  E el«(X)L  L,  sup  E [<S(X)]) 
lflreoliA ' IVeol-A+e  ' - 


This  finishes  the  proof. 

From  Theorem  1.7.2,  we  get 

(11,  „ -tM-A  -tM+A 

es*  r <T'i«)  * 


r , ^M*A"e  -tu-A-e 

= L-,Od ) . ♦(-? )]  . 

o ►?  O 


From  Lemma  1.8.2.,  it  Is  obvious  that 

(11.  -tr-A  -t-+A 

SUP  r'  ’h,se)  ■ max(L, [♦(-£-)  , *f _5— ) J , 

Jl  a SZ  a 


And  If  * P*  . then 


. tfi-A-e  -tr-A-e 
l2[*(-£_)  . *(_| — )]). 

O O 


sup^r(T,fiB)«max(L1,  L2[*(i)  - *(~))  . 


(1.8.3) 
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If  Bq  = 8^  a B2,  then 


’V^ 


"Vu0, 


sup  r( t,Ar)  =■ 
xte* 


max^lX-^)  + *(-^)].  L2[*(-f)  - *(-f)]) 

2 

1f  T l«o'“l  I - A+e 
8 

P-  tp-p  “ tn~ U 

i*'ax(L^[4>( — | — -)  + *(-—■)].  L2[*(-|-) -♦(-!-)]) 


i f ^ | 1 < A+e  . 

8 


Remark:  All  the  risk  computed  in  this  section  are  based  on  one  sample 

from  each  population.  If  we  have  n samples  from  each  population,  by 

2 a2 

reducing  to  sufficient  statistic,  we  only  need  to  change  cr  to  p-  , and 
all  the  formulas  will  remain  valid.  The  formulas  are  used  to  compute 
the  following  tables. 


Illustration  of  the  table: 

(1)  The  control  parameter  PQ  is  assumed  to  have  prior  distribution  as 

2 2 

N(0,1),  and  is  assumed  to  be  distributed  as  N(a,3  ),  where  (a, 3 ) 

are  chosen  as  (1,1),  (0,.5),(0,1)  and  (0,2)  in  the  tables. 

2 

(2)  —are  chosen  as  .2  in  Table  I and  as  .5  in  Table  II. 

(3)  A are  chosen  as  .5,  1.,  and  1.5. 

(4)  For  A= .5 , e are  chosen  as  .2  and  .4. 

For  A=1 . , e are  chosen  as  .3  and  .8. 

For  a=1.5,  e are  chosen  as  .5  and  1. 

2 2 

(5)  When  (a, 8 1 A and  e are  fixed,  X and  X'  are  computed  so  that 
Tg€r.  tg,  tfi  and  tM  are  found,  and  r(tg,<$),  sup  r(-r,6),  and 

sup  r(t,(S)  for  6 = 6a,  6r,  <5U  are  computed. 

*6*  B fi  M 


They  are  arranged  in  the  following  manner: 


(6)  All  tables  are  computed  under  the  assumption  that  . 

To  use  the  table: 

2 2 

the  Bayes  rule:  For  specified  values  of  , (a,B  ),  A and  c, 

look  for  tR  and  the  risks  in  the  first  row  of  each  block. 

2 

the  r-minimax  rule:  For  specified  values  of  2_,  A,  e,  X and 
X',  look  for  tfi  and  the  risks  In  the  second  row  of 
each  block. 

2 

the  minimax  rule:  For  specified  values  of  A and  e , look 

n 

for  the  tN1  and  the  risks  in  the  third  row  of  each  block. 

1 


(a)  For 

(b)  For 

(c)  For 
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Table  1.1  1.  The  first  column  lists  values  of  tR,  tfi,  and  t^. 

2.  The  second  block  of  numbers  are  values  of  r(TR,5), 

sup  r(x,6)  and  sup  r(x  ,<5 ) corresponding  to  6*6D,6r  and  6M 
t€T  8 G M 

3.  The  entries  in  the  third  block  are  values  of  ratios  of  the 
risks  in  the  second  block  (dividing  each  column  by  the 
diagonal  value). 

o2/n  = .2  , (a,  B2)  = (1,1) 


e = -2, 

X = .2174 

X'  = 

.6987 

.5657 

.1529 

.5201 

.5802 

2.3924 

1.2991 

0. 

.2174 

.2174 

1.0 

1.4216 

2.2391 

.6421 

.1598 

.4091 

.4466 

1.0450 

1.8818 

bd 

X = .2174 

X"  = 

.6177 

.6820 

.1193 

.4440 

.5661 

2.0424 

1.4642 

0. 

.2174 

.2174 

1.0 

1 .8224 

2.5865 

.7261 

.1260 

.3229 

.3866 

1.0562 

1.4851 

c ~ .3, 

X * .4214 

X'  = 

.4679 

1.1499 

.1224 

.4775 

.6709 

1.3280 

1.6509 

1.0166 

.1420 

.3595 

.4902 

1.1593 

1.2063 

1 .1503 

.1312 

.3614 

.4064 

1.0711 

1.0052 

c = .8, 

X = .4214 

X'  = 

.3097 

1.4000 

.2648 

.5503 

1.4065 

2.0878 

1.5542 

.1883 

.3488 

1.0593 

1 . 3232 

1.4001 

.0649 

.1927 

.2636 

1.1297 

1.0235 

c * .5, 

X = .5996 

X'  = 

.2567 

.3332 

.6824 

1.4156 

1.9704 

2.4287 

.0999 

.2354 

.7511 

1 . 3594 

2.1687 

1.7500 

.0827 

.2966 

.3463 

1.1253 

1.2600 

e * 1.0, 

X * .5996 

X"  = 

.1511 

2.0000 

.5628 

1.5135 

2.6227 

2.5514 

1 d 

.5324 

1.2552 

2.4808 

2.0000 

.1611 

.2146 

1.2445 

1.4733 

4 

i 
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Table  I (Continued)  , Table  1.2 
o2/n  - .2,  (a,p2)  = (0,  .5) 


wm 

X * .3169 

X'  - 

.5676 

.5731 

.2056 

.7466  1.0 

2.3559 

2.2391 

0. 

.3169 

.3169  1.0 

1.5414 

2.2391 

.6421 

.2097 

.3950  .4466 

1.0199 

1.2465 

A = .5 

e * .4, 

X = .3169 

X'  = 

.4624 

.6861 

■SI 

.7043  1.0 

2.4115 

2.5865 

.4766 

.2921  .5760 

1.2540 

1 . 48<>9 

.7261 

.3013  .3866 

1.0464 

1.0316 

e = .3, 

X = .5858 

X'  « 

.2885 

1 

■a 

.8687  1.0 

3.0719 

2.4606 

. 1 

Ka 

.2828  .8955 

1.3802 

2.2034 

1 E 

. 1 51 2 

.3553  .4064 

1.1608 

1.2564 

A * 1. 

e * .8, 

X = .5858 

X"  = 

.1416 

1 . 4000 

.7268  1.0 

6.0236 

3.7939 

2.1098 

.1207  .6879 

1.2167 

2.6098 

1.4001 

.0708 

.1917  .2636 

1.3920 

1 . 5891 

e * .5, 

X = .7793 

X'  = 

.1025 

1 

EH 

.8818  1.0 

8.6338 

2.8875 

.1021  .9850 

1 .6986 

2.8443 

.3054  .3463 

1.4732 

2.9901 

e * 1.0,  X * .7793  X'  = .0412 


■n 

.8206 

21.3210 

4.6599 

3.1757 

.0385 

1 

1.5578 

3.9951 

n 

.1761 

u 

2.0717  4.5754 

Table  I (Continued),  Table  I<3 
a2/n  « .2,  (a,  B2)  - (0,1) 


e ■ .2, 

A • .2763 

A'  - 

.6206 

.5657 

.4721 

1 .4800 

1.0570 

0. 

1.0 

1.4525 

2.2391 

.6421 

lImtM 

1.0016 

1.4496 

DD 

A = .2763 

A'  « 

.5245 

.6820 

.1459 

.3237 

.4454 

1.1791 

1.1520 

.1873 

.2297 

.2745 

.8281 

1.5743 

2.1419 

.7261 

.1479 

.3096 

.3866 

1.0132 

1.1279 

e * .3, 

A * .5205 

A'  • 

.3580 

HI 

f 

.3397 

.5503 

1.0190 

1 . 3540 

.3333 

.7097 

1.0928 

1.7463 

1.1503 

.3570 

.4064 

1.0818 

1.0711 

e * .8, 

A - .5205 

A'  ■ 

.2031 

ESI 

.1597 

.4247 

1.0341 

1.6114 

K21 

.1.545 

.5444 

1.0743 

2.0656 

.0690 

.1907 

.2636 

1.1979 

1.2347 

EH 

A - .7112 

A'  • 

.1573 

.0647 

.5628 

1.3610 

1.6252 

1 ! 

.1546 

.9349 

1 .6478 

2.6995 

RBI 

.3008 

.3463 

1.2433 

1.9453 

BBZ 

1 

A'  ■ 

.0771 

2.0 

.0887 

.4372 

1 . 3378 

2.0372 

2.8887 

.0663 

.7306 

1.4276 

3.4045 

2.0 

ESSI 

.1692 

.2146 

1.5211 

2.5505 

t 


G3 


Table  I (Continued),  Table  1.4 
o2/n  = .2,  (ct,62)  « (0,  2) 


e = .2, 

X = .2272 

X'  = 

.6861 

.5605 

.1627 

1.0 

3.0855 

2.2391 

.2272 

.2272 

1.3966 

2.2391 

.6421 

.1638 

.4079 

.4466 

1.0072 

1 . 7954 

e = .4, 

X = .2272 

X'  = 

.6033 

.6791 

.1271 

.6989 

1.0 

3.0764 

2.5865 

0. 

.2272 

.2272 

1.0 

1.7876 

2.5865 

.7261 

.1286 

.3211 

.3866 

1.0120 

1.4134 

e = .3, 

X = .4363 

X'  = 

.4529 

1.1499 

.1260 

;■ 

2.4156 

2.4606 

1.1044 

.1350 

1.0719 

1 .0700 

1.1503 

.1318 

1.0459 

1.0006 

£ = .8, 

K 

X"  = 

.2986 

1.4000 

.0588 

.7317 

1.0 

3.9116 

3.7939 

1.5896 

.0597 

.1871 

.3697 

1.0138 

1.4026 

1.4001 

.0648 

.1937 

.2636 

1.1017 

1.0357 

e = .5, 

X = .6135 

X'  = 

.2482 

1 . 7500 

.0725 

.8614 

3.7480 

2.8875 

2.4739 

.0982 

.2298 

•1 

1.3543 

2.2326 

1.7500 

.0810 

.2984 

1.1174 

1 .2985 

e = 1.0 

, X = .6135 

X'  = 

.1489 

2.0 

.0304 

■WjB 

7.0051 

4.6599 

2.5663 

.0375 

1.2320 

2.5246 

2.0 

.0376 

.1636 

1.2364 

1 .5034 
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Table  1 1 - 1 1.  The  first  column  list  values  of  tg,  tR  and  t^. 

2.  The  second  block  of  numbers  are  values  of  r(tg,6), 

sup  r(T,S)  and  sup  r(i,6)  corresponding  to  6=<$D,<5r  and  tfu. 

T T T * B G M 

3.  The  entries  in  the  third  block  are  values  of  ratios  of  the 
risks  in  the  second  block  (dividing  each  column  by  the 
diagonal  value). 

a2/ n = .5,  (a,  B2)  = (1,  1) 


e = .2, 

X = .2174 

X'  = 

.6987 

.3275 

.2104 

.4242 

.7625 

1.9511 

1.6022 

0. 

.2174 

.2174 

1.0 

1 .0334 

2.1013 

.8054 

.2382 

.4360 

.4759 

1.1321 

2.0051 

5 

c = .4, 

X = .2174 

X'  = 

.6177 

.5571 

.1848 

.4761 

.5985 

2.1900 

1.3432 

0. 

.2174 

.2174 

1.0 

1.1767 

2.2444 

.8615 

.2038 

.3721 

.4456 

1.1030 

1.7115 

e = .3, 

X = .4214 

X'  = 

.4679 

1.1441 

.1951 

.5559 

.8143 

1.4109 

1 .8320 

.9872 

.2358 

.3940 

.5286 

1.2084 

1.1892 

1.1771 

.2199 

.3953 

.4445 

1.1270 

1.0031 

e = .8, 

X = .4214 

X'  = 

.3097 

1.3980 

.1148 

.3597 

.7869 

1 .4676 

2.2599 

1.8158 

.1258 

.2451 

.5062 

1.0958 

1.4535 

1.4117 

.1404 

.2546 

.3482 

1.2230 

1.0385 

c = .5, 

X = .5996 

X'  * 

.2567 

1 . 7500 

.1264 

.3827 

.8697 

1.5118 

2.1659 

3.4468 

.1864 

.2532 

.9260 

1.4749 

2.3062 

1.7509 

.1578 

.3438 

.4015 

1.2491 

1.3581 

5 

c = 1 .0 

, X = .5996 

X'  = 

.1511 

2.0000 

.0674 

.2222 

.8413 

1.5820 

2.7259 

3.3785 

.0902 

.1405 

.8102 

1.3382 

2.6248 

2.0003 

.0972 

.2317 

.3087 

1.4427 

1.6493 

: 


Table  II 


(Continued),  Table  II. 2 
a2/n  = .5,  (a,g2)  = (0,  .5) 


e = .2, 

X = .3169 

X'  = 

.5676 

.4188 

.2880 

.6136  1.0 

1 .9363 

2.1014 

0. 

.3169 

.3169  1.0 

1.1002 

2.1014 

.8054 

.2930 

.4209  .4759 

1.0171 

1 .3283 

A = .5 

. 

£ = .4, 

X = .3169 

X'  = 

.4624 

.6014 

.2386 

.6372  1.0 

2.0105 

2.2444 

0. 

.3169 

.3169  1.0 

1.3281 

2.2444 

.8615 

.2443 

.3472  .4456 

1.0236 

1.0957 

- - 

e = .3 

X = .5858 

X'  = 

.2885 

1.1469 

.1968 

.8594  1.0 

2.9828 

2.2499 

3.5136 

.2654 

.2881  .9866 

1.3485 

2.2197 

1.1771 

.2522 

.3886  .4447 

1.2814 

1.3487 

A = 1. 

n 

ii 

CO 

X = .5858 

X'  = 

.1416 

1 .3991 

.0928 

.7249  1.0 

5.2358 

2.8717 

3.1767 

.1131 

.1384  .9157 

1.2183 

2.6296 

1.4117 

.1562 

.2533  .3482 

1.6834 

1.8309 

t = .5 , 

X = .7793 

X'  = 

.4990 

1.7500 

.0803 

.8815  1.0 

8.6025 

2.4904 

SESl 

.1022 

.1025  .9999 

1.2727 

2.4902 

Riff 

1 

.1579 

.3541  .4015 

1.9662 

3.4554 

B 

m 

X = .7793 

X'  = 

.0412 

2.0000 

■SB 

.8205  1.0 

19.9395 

3.2399 

4.9393 

ra 

.0412  .9926 

1.2711 

3.2161 

1 

2.0003 

m 

.2533  .3087 

3.1252 

6.1546 

I 

I 


K 
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Table  II  (Continued).  Table  II. 3 
a2/n  = .5,  (u,p2)  = (0,  1) 


.2,  A = .2763 


.8054 


1 .1441 
2.4180 
1.1771 


1.3980 

2.5834 

1.4117 


1.7500 

4.7675 

1.7509 


2.0000 

4.2218 

2.0003 


= .2763 


2296 

.3541 

2763 

.2763 

2297 

.3568 

= .5205 


3 

.36 

0 

.35 

121  .1967  .6167 
238  .1887  .7833 
506  .2520  .3482 


e = .5, 


066  .2081 
517  .1572  . 

592  .3487  . 


1493 

.li 

1659 

.0 

1979 

.2' 

1.3958 

2.1014 


1 .2814 

1.0311 

1.2034 

2.2444 

1.0002 

1.2913 



1.0278  1.4851 

1.1500 

1 .9532 

1.1416 

1.1106 

1.0427 

1.7711 

1.1044 

2.2494 

1.3437 

1.3356 

) 

= 

.1573 

1.3237 

1.8280 

1.4228 

2.4834 

1.4929 

2.2177 

A 

* _ 

.0771 

1.3244 

2.2402 

1.3379 

3.1020 

1.9879 

3.1959 

t.  • 


Table  II  (Continued),  Table  n.4 
o*/n  • .5,  (a,62)  - (0,  2) 


• * 

A - .2272 

A1*  ■ 

.6861 

.2161 

.2251 

. 3855  1 . 0 

1.6970 

2. 1014 

0. 

.2272 

.2272  1.0 

1 . 0091 

2.1014  1 

.8054 

.2417 

.4346  .4759 

1.0739 

1.9131 

A ■ .5 

! » .4. 

\ .2273 

\ ’ ■ 

.6033 

I 

1 

.51:17 

.201 1 

.5377  1.0 

2.3667 

2.2444 

0. 

.2272 

.2272  1.0 

1.1298 

2.3444  | 

.8615 

.2061 

.3700  .4456 

1 .0249 

1.6289 

e • .3, 

\ .4363 

V - 

.4529 

1.1409 

.2058 

.8343  1.0 

2.1109 

2.3499 

1.1754 

.2212 

.3952  .4452 

1 .0744 

1 .0016 

1.1771 

.2210 

.395?  .4445 

1.0739 

1.0000 

A * 1 . 

« * .8. 

A • .4363 

A*  • 

.4895 

1.3968 

.1205 

.7177  1.0 

2.9617 

2.8717 

1.8999 

.1230 

.2423  .5397 

1.0205 

1 . 5498 

1.4117 

.1400 

.3559  .3482 

1.1686 

1 . 0562 

1 

e ■ .5, 

A » .6135 

\*  • 

.3482 

1.7459 

.1269 

.8574  1.0 

3.4916 

2.4904 

3.5599 

.1833 

.2456  .9406 

1.4440 

2.3425 

1 . 7509 

.1563 

.3460  .4015 

1.3312 

1 .4091 

A • 1.5 

r ■ 1 , 

A » .6135 

\*  - 

.1489 

2.0000 

.0679 

.7615  1.0 

5.4734 

3.8399 

3.4159 

.0878 

.1391  .8201 

1.2924 

2.6572 

2.0003 

.0967 

.2353  . 3087 

1.4249 

1.6915 
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1.9  An  example  and  conclusion 

After  we  study  the  tables  computed  in  Section  1.8,  some  trends 
can  be  found: 

^ 0 

1.  If  (R2)  = (j),  then  the  Bayes  rule  performs  very  well  in  terms 

a n 

of  sup  r(i,iSR)  and  sup  r(t,dg);  this  is  because  when  ( 2)  = (V), 

icr  rte*  “ 1 


a,-art  3 


7 lxl'xol » bence  Bayes  rule  has  the  same  form  as  r-minimax 


rule  and  the  minimax  rule. 

2 

2.  If  fi  f 1,  the  Bayes  rule  does  not  perform  as  well.  This  was 
shown  in  formula  (1.8.2)  and  (1.8.3).  One  can  also  find  that  when 

V * ,5,  ^ = A = 1*5,  and  e * 


sup  r(t  ,(S„) 
Ltr  8 

sup  rrf,«ry 
t cr  ” 


19.9395 


This  means  a large  increase  in  loss  will  occur  if  we  need  to  consider 

sup(r,.sR)  instead  of  r(rR,0. 

t«r 

3.  r-minimax  rule  is  robust  in  terms  of  sup  r(t,8R)  if  x^  and  x,' 

are  close  to  each  other.  This  is  because  sup  r(t,<$r)  * max  (A,B), 

vCO* 

but  sup  r(v,iSR)  - X|A  + XjB.  Also  from  the  tables,  for  all  X and  X', 

r(TB’lV  - 2 V'V* 

4.  Minimax  rule  in  general  performs  fairly  well. 


5.  r-minimax  rule  is  not  necessarily  better  than  the  minimax  rule 


in  terms  of  the  Rayes  risk.  This  means  that  when  the  decision  depends 
on  full  information,  sometimes  incomplete  information  is  worse  than  no 
information. 


"*”r-»T»Tr 


(>!) 


6.  When  e gels  larger,  all  risks  become  smaller. 

Example  1.9.1:  A company  has  type  Jig  machines  to  produce  part  P(p) 


[p  is  the  diameter  of  P]  and  p|11g  ~ N( 0^x1 0~2  in.,  lxlO'4  sq.  in.). 


However,  the  same  company  also  has  type  , n^.  n3  machines  which  produce 

p A 

part  Q(q)  and  q | IT^  ~ NfO^xlO’^  in.,  1x10'  sq.  in.).  P and  Q are  matched 


if  |p-q|£.045  in.  Since  1 0^ -0q| <.  1.5  > Pf | p-q ) <4. 51  = P [ 


-4.5-(o.-o0) 


/2 


4 . 5-  ( 0 . - 0-. ) 3 

< Z < ]>  P[ — < Z < — ] = .98,  so  we  can  define  II.  as  good 

" ~ n ~ 1 


for  IIq  iff  I °i -0o ^ Similarly,  we  would  like  to  define  IK  as  bad 

for  rig  iff  |0.=0gj  > 2.5.  The  company  claims: 

P[|ere0l  <1.5]  = .78  , P[|e1-e0|  >2.5]  = .04 

P[|02-0g|  <1.53  = .71  . P[IVeol  - 2-5-  * *0a 

P[|e3-e0|  <l.5]  = .61  , P[|e3-e0|  >2.5]  = .15. 

Now,  the  company  has  machines  ag,  a^,  a2,  a3  for  sale,  where  a^tiK,  for 

i = 0,1, 2, 3.  If  we  are  allowed  to  take  5 sample  parts  from  each 
machine,  which  machines  to  produce  part  Q should  we  buy? 


Solution:  Let  Xg,  X^ , X2>  X3  be  the  mean  observation  from  ag,  a^ , a2,  a3, 

2 


respectively.  Then  — , A = 1 .5,  c = 1 .0,  and  if  we  decide  to  use 


r-minimax  rule,  the  table  1.2,  1.3  and  1.4  Indicate 
is  good  for  ag  iff  |X^-Xg|  £ 3.1757 

a2  Is  good  for  ag  iff  |X2-Xg|  £ 2.8887 

and  a3  is  good  for  ag  iff  |X3-Xg|  < 2.5663  . 


* 
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If  we  feel  the  claims  made  by  the  company  may  not  be  correct  and 
we  would  rather  assume  there  is  no  prior  Information,  then  we  will 
decide  to  use  minimax  decision  rule.  Then 

ai  1s  900d  for  aQ  iff  1x^1  <2.0  for  all  i=l  ,2,3. 

If  from  another  source,  we  know  more  informations  such  that 

°0  ~ N(c’  °l  ~ N^c’  -5)»  e2  ~ N(c,  1),  and  e3  - N(c,  2),  for  some  c, 
then  we  would  like  to  use  the  Bayes  rule.  So 


. , , ,c4  + M-5)  + xnO> 

a]  Is  good  for  aQ  iff  2^ 1 5 0 | < 2 

r + .5  k + 1 


i.e. 


2c  + 5 X,  c + 5 X, 


a2  is  good  for  aQ  iff  \J2  - 70|  <2.40 


0. 


< 2.0 


c + 10  c + 5L 

and  a^  is  good  for  ag  iff  | — ^ L . ^ - Q 


< 2.0. 


If  we  suspect  the  definiteness  of  any  prior  information,  we  may  then 
use  the  rule  which  is  most  robust  to  the  assumption  of  the  prior  distri- 
bution. So  from  the  table  we  use  r-mlnlmax  rule  on  a^,  use  Bayes  rule 
on  a,,,  and  use  the  minimax  rule  on  a^. 
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CHAPTER  II 

-MINI MAX  RULES  FOR  SELECTING 


THE  t-BEST  POPULATIONS 


2.1  Introduction 

In  this  chapter,  wo  continue  further  investigations  of  the  r-minimnx 
procedures.  The  problem  considered  here  is  to  select  the  t-best  popula- 
tions out  of  k populations  for  some  fixed  t < k.  Deverman  and  Gupta 
(1969),  Carroll,  Gupta  and  Huang  (1975)  have  discussed  this  problem  under 
the  subset  selection  approach.  Carroll  and  Gupta  (1977)  also  provided  an 
algorithm  which  can  be  used  to  compute  the  ranking  probability 

P((X] X^)  < (Xt  +1 Xt  )<...<  (Xt  +1,...,X^)),  where  X^  has 

pdf  ffx-ep  and  o,  - ...  - e < o , - .S.*ot  < ...  < o t,  ■ ...  -o, 

II  l S 

For  the  problem  of  selecting  exactly  t population,  Bahadur  and  Goodman 


(1952)  and  Alam  (1973)  have  shown  some  optimal  properties  of  the  natural 
selection  procedure. 

In  Section  2.2,  it  is  shown  that  if  the  populations  have  PF0  densities, 
then  the  natural  selection  rule  is  a r-minimax  rule.  This  result  is  also 
extended  to  the  case  when  the  populations  are  not  required  to  be  indepen- 
dent but  have  some  particular  form.  This  is  done  in  Section  2.4.  In 
Section  2.3,  our  goal  is  to  rank  the  k populations  through  a simultan- 
eous selection  of  the  t-best  populations  for  all  1 s t < k - 1.  In  order 
that  a r-minimax  rule  can  be  obtained,  we  need  to  change  the  loss  function 
we  used  in  Section  2.2  slightly,  so  that  an  indifference  zone  is  allowed. 
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The  result  obtained  in  this  section  justifies  why  we  adopt  midrank  for 
tied  data.  In  the  last  section  of  this  chapter,  the  result  of  Gupta  and 
Huang  (1977)  is  generalized  and  it  is  shown  that  the  I’-minimax  rule  for 
selecting  the  best  population  can  be  found  even  if  the  populations  are 
not.  independent.  We  also  prove  a lemma  which  will  help  us  to  find  the 
I' -minimax  rules  for  testing  hypotheses  about  multinomial  distributions 
and  multivariate  negative  binomial  distributions. 


2.2  Selecting  the  t-best  populations 

Let  11^ llk  be  k independent  populations  with  n.  associated 

with  distribution  function  F^(x)  = F(x-e^),  where  Gi  is  unknown  for  all 


i = 1,  2,  ...,  k.  Denote  by 


the  true  (unknown) 


ordering  of  the  parameters.  Let  t < k,  then  we  say  that  n.  is  among 


the  t-best  populations  if  0.  s We  wish  to  select  exactly  t 

populations  such  that  any  of  them  is  among  the  t-best  populations.  The 


problem  will  be  formulated  as  follows: 


Let  x = (x  = (Xj,...,x^)  | -«>  < x.j  < <»  for  all  i = 1,  2,  ....  kl  and 
H Id  1 lO-j  - •••0k)  | -«  < 0^  < °°  fur  all  i = 1 , 2,  . . . , k).  Also  let 
K 3 (l,2,...,kl  , T 3 0,2,...,t}.  Let  S = (s  | s : T -*•  K is  1 - 1 


function  and  s(i)  < s(j)  if  i < j),  then  for  each  stS,  . 


* i's(l ) ’ ' ’ * * 1 1 s ( t ) ^ denote  a possible  choice  of  the  set  of  t-test 

* k 

populations.  It  is  clear  that  S contains  (t)  elements,  we  will  denote 

them  by  s^ , s? sr,  where  r = (t).  In  view  of  the  definitions  given 

above,  we  use  the  Rorel  o-field  for  <=•  and  X,  and  the  discrete  o-field 


for  S. 


Definition  2.2.1. 


A measurable  function  <j>  • X v S * [0,1]  is  a selection 
rule  if  for  each  x >.  X,  we  have 

>:  4>(x,s)  = l . 
s^S  ' 

It  is  understood  that  ^(x,s)  is  the  conditional  probability  of  selecting 
*ns(l  )’• • * ’ns(k)* ’ hdvin9  observed  x.  For  1 v i > k,  let 

S.  1 = {s  t S | i v.  s(T) } 

S.2  » (s  t S | i 4 s(T)  I , 

then  S-1(S12)  is  the  collection  of  all  subsets  of  size  t which  in- 
cludes (does  not  include)  n^.  For  each  given  4. , we  have  the  following 
definition. 

Definition  2.2.2.  The  k functions  defined  by 

Mx)  = 2:  <Hx,s),  i = 1 , 2 k, 

(2.2.1) 

are  the  individual  selection  probabilities;  «K(x)  is  the  conditional 
probability  of  including  population  IK  in  the  selected  subset  having 
observed  x.  Tt  follows  that  6.(x)  satisfies: 

(i)  0 < i ( x ) <1  for  all  1 < i s k,  and  all  x. 

k k 

(ii)  r s (x)  * r r.  *(x,s)  • r r *(x.s) 

1*1  1*1  stSn  ~ s^S  iv.s(T) 

“St  4> ( x ,s ) = t,  for  all  x. 

Sv.S 


(2.2.2) 


(2.2. j) 


I 
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i 

Now,  we  have 

'i 

Lema  Given  <S(x)  » (fi1(x),...,«|({x))'  satisfying  (2.2.2),  (2.2.3),  • 

there  always  exists  at  least  one  selection  rule  $ such  that  (2.2.1)  holds. 


Proof:  (2.2.1)  defines  a simultaneous  linear  equation 


Ak*r^?Vxl  = ~^kxl 


(2.2.4) 

where 

$(*)  ~ (4*(x»s^),  ^ ( x , $2 ) , . . . , «t>( x ,sr ) ) 


A = (a..)  is  the  matrix  with 

* J 


aij  = 


r ° if 

i 1 if  it  Sjd) 


for  all  i = 1,2,... ,k. 


It  is  understood  that  for  x fixed,  6(x)  is  just  a vector  in  Fk  and 

>!•(*)  is  just  a vector  in  Fr.  For  simplicity,  they  will  be  denoted  by 

v =(v]  *v2' ■ • • ,v|<)  an<i  u = (u]  »u2» • • • *ur) • respectively.  Now,  consider 
. k l 

V = fv  I t,  v.  = tl  O [0,1]  , then  V is  a closed  and  bounded  convex  set. 
i=l 

For  v t V,  wlog,  we  can  let  v = (1 ,1 ,. . . ,1 ,a1 ,. . . ,a^,0,. . . ,0)  where 
1 > a^  i ...  i a^  > 0.  If  l f 0,  let  = 1 - a^,  and 

' = l^rl  A r2^’  t*ien 


- = 2° Ua1+f,a2-e,....,a2m+1+£,a2  +t,a2  -2.  ) 


+ 2(l,...,l,are,a2+e, a?m+r,  ’a?m+?‘l  ’d?m*j+2, ) 


2m+l  2m+2  ' 2m+3' 

2m+r'  ,a2m+2‘!  'd2m+3T 


if  8 = 2m  + 3 . 


Hence  if  8^0,  v is  not  an  extreme  point  of  V.  This  shows  the  ex- 
treme points  of  V are  the  permutations  of  ( 1 ,1 , . . . , 1 ,0, . . . ,0)  * 

(with  t l's),  and  they  are  just  the  columns  of  A.  Since  points  in  V 
can  be  expressed  as  a linear  combination  of  its  extreme  points,  this 
proves  that  equation  (2.2.4)  has  at  least  a solution  which  is  also  a 
selection  rule. 

Lemma  2.2.1  does  not  exclude  the  possibility  that  more  than  one 
selection  rules  may  have  the  same  individual  selection  probabilities.  For 
example,  when  k = 4,  t = 2,  let  1^  > II2  " li3  - n and  if 


4y  selects 


<}>2  selects 


{(nrn2) 

<"2-V 

<V'4> 

1 <vv 


with  probability  ~ 
with  probability  ~ 
with  probability  y 

with  probability  y 
2 

with  probability  y , 


then  <j>.j  ^ 4>2;  but  they  have  the  same  individual  selection  probabilities. 

is  better  than  4>2  in  the  sense  that  4y  has  y chance  to  select 
the  true  2-best  populations.  However,  4>2  is  better  than  <jy  in  the 
sense  that  <f>2  always  selects  one  of  the  2-best  populations.  At  this 


76 


stage,  it  is  hard  to  judge  which  one  of  ^ and  ^ is  better  than  the 
other.  For  our  convenience,  we  simply  treat  them  as  equivalent.  Thus, 
we  can  consider  decision  rules  in  terms  of  the  individual  selection 
probabilities. 


Let  D = (6  * | 0 < 6.(x)  < 1 

V.  (S^x)  * t).  For  given  e > 0 and  s 6 S, 

- ina*  v 6,-  +cJ-  Then,  0 *-  [ U 0c]u®n, 
itfs(T)  s€S  s u 

Note  that  0f  = $ if  1 ? j. 


for  1 < i s k,  and 

let  0 ■ {0  € e | min  G. 

' 1€s(T)  1 

where  en  = 0 \(  u 0 J. 
u s€S  s 


Definition  2.2.3.  For  any  s t S,  Let  As  t [0,1]  be  given  and 


A < 1.  Then 
stS 


r = { t | t 


is  a prior  distribution  on  0 ind 
<Mo)  * A$  for  all  s t S } . 


Definition  2.2.4.  For  any  0 €0  and  <S  t D,  the  loss  function  L is 
defined  as 

k , . 

L(0,.s(x))  * £ r LlS,(0,6.(x)), 

Sts  j=l  ~ J ' 

whore 


L(s)(M.(x))  = 


0 

for  all  j if 

0 f 0 

Ls,(l-Sj(x)) 

for 

J t s(T) , 

G t 0 

Ls2  Sj(?> 

for 

j if  s(T), 

0 fc  0 

A similar  loss  function  was  considered  by  Gupta  and  Huang  (1977).  Let  us 

further  assume  that  IT.  has  pdf  f.(x)  = f(x-fl.)  and  let 
k 1 11 

f,.(x)  3 tl  fjx.).  Then,  we  have 
i = l ’ 1 
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r(T'-) ' si  k k jf,  d?  *<•> 


sts  k j«s(T)LsI  * Ls2  E§Cj«sm  V?)]  ‘ Ls\  E?[j’:s{T)V*1]d'(0) 


= l 


(2.?. 5) 


Now,  we  can  prove  the  following  theorem: 


Theorem  2.2.1.  If  for  all  s € S,  there  exists  a 0*  € ^ such  that. 


sup  E [ T <$°(X)]  - E0*  [ E «?(X)] 
§tws  ~ jf£S(T)  J ' ?s  j(?S(T)  J ' 


and 


Ski  6?(-x»  • E?r  Vs.T, (-x,]  • 


where 


s°M  - 


and 


1 if  Nj(?}  < N[t](^ 


< r (x)  if  N • ( x ) = Nftl(x),  s r.(x)  - t - t. 

; J m ~ j-1,+1  J ' 1 

^ 0 if  Nj (x)  > N[t](x) 

(2.2.6) 


N[n(x)  < ...  < N[ti+1](x)  = ...  = ?i[t](x)  = ...=  N[t^(x)<  ...<N[k](x) 
is  an  ordered  permutation  of  ^(x),  j = 1,  ?,  ....  k,  and 


Nj(x)  = Mj2(x)  - M j | ( x ) , where 


Mj2^  S^s  Ls2  As  f 0*  Mjl^  = $^s  LslAs  f§*  t*) 

Then,  fp  = (6^,  — ,6^)  is  a r-minimax  rule. 


Proof:  Let  in  be  such  that  P [0  = 0*]  = \ for  all  s € S, 

T0  ~ ~s  s 

then  for  all  6 t D, 

sup  r ( t , 6 ) > r ( tq , 6 ) = Z A f . 2 L(s)(0* fi.(x))  f.*(x)  dx 

xtr  - u ' stS  S JFk  j = l ~s  J ~ ®s  ' ~ 

= £ f k ( £ + E ) [L(s)(0*,6.(x))  A f *(x)]  dx 

j = l ¥ stSjl  stSj2  ' 9S  ' ~ 

k 

= E k M.,(x)  + N . ( x ) 6 - ( x ) dx 

j=l  JKk  Jl  ~ J - J ~ - 

- f k Mn(x)  + Mx)  5?(x)  dx 

, k 

= E A$  j k Z L(s)(e*  6*?  (x) ) f * (x)  dx 

stS  JFk  j = l s J ~ §s  ~ 


>:  a { z 

stS  jts(T) 


Lsl  + L52E!|tj^(T)  5jP)] 


- LslE8.[  1 «,(*)]  » 

sl  ?s  jts(T)  J ~ 


£ E 

stS 


L E L .+  L ,E  [ Z 6?(X)] 
% jts(T)  Sl  s2  5 jtfs(T)  J ~ 


- LslE  [ Z 6°(X)]di(0) 
51  ? j*s(T)  J ' 


r ( 1 . « ) 


for  all  r t r . 
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So, 


sup  r(x,6)  > sup  r(x,SU)  . 

x€r  ' x^r 

This  proves  that  6^  is  a r-minimax  rule. 


Let  us  take  o*  = (6-j  ,e2»  • • • »6k) » where 

9i  = °0+e  ^ ’ fcs(T)  and  9i  = 9q  if  i $ s(T).  (2.2.7) 

The  constant  Og  will  be  determined  later.  We  would  like  to  investigate 

N^(x)  s N.(x)  first.  We  find  that 

Nj(x)  > Ni (x ) 

if  and  only  if 

s S|s-i  <Lsl+Ls2>  ' 

If  (Ls -j  +LS2 ) As  = c,  where  c is  some  constant,  for  all  s L S,  then 


Nj(x)  > N. (x) 


f fn*(v)  > z fn*(x) 

S«n\sji  - 

fen+c^X|l) 

z n _ o . 

stS1!VSjl  lts(T)\fi}  fo  (,t> 


f90*e(xj> 

TTTxTT 


z 


n 


f0g+e^xt^ 


VY  sfcSjl\Sil  ^s(T)\{j}  \{X9) 


But  for  all  st  S^XSjj,  (s(T)\{i))  U{j} 

and  vise  versa.  So  we  get  f . 

. °n+e  X* 

r n u 

seSil\Sjl  "\(X^ 


s’(T)  for  some  s'tS^\S.^; 


Z II 

sfeSj1\S11  ^s(T)\ijl  To0 


f0g+. 


f,rog~ 


i — - 
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Hence, 


N-(x)  < N . (x)  iff  g (x.)  > g (x.) 
1 ~ J ~ °0  1 0O  J 


(2.2.8) 


where 


9a  (*)  = 


V’ 


Now,  g is  increasing  in  x for  any  0 if  f (x)  has  monotone 
u0  v 0 

likelihood  ratio  in  x.  Then, 


xi  > x.  =>  Ni (x)  ^ Nj(x)  . 

It  is  well  known  that  if  X-|,...,X^  are  independent  and  the  density 

f0  (x)  of  X.  has  MLR  in  x for  all  i=l k,  then  E0[6(X)]  is 

increasing (decreasing)  in  e.  if  6(x)  is  increasing(decreasing)  in  xr 
(Lehmann(1959) ) 

We  can  now  state  the  main  theorem  of  this  section  as  follows: 


Theorem  2.2,2.  Let  X^,...,X^  be  independent  random  variables.  Assume 
that  X.  has  pdf  f0  (x)  = f(x-0^),  which  has  MLR  in  x.  Furthermore, 
if  for  all  sts,  (lst+i-s2^s  = c’  ^or  some  constant  c»  then 


('Sj . 1^2  * - ■ 

★ . 

. ,<Sk)  is  a r-mimmax 

rule 

, where 

r , 

i 

if 

x.  > x^ 

J 

*!<?>  ■ 

t,-t{ 

i 

if 

*j  ■ ‘[t] 

C o 

if 

X,  < ,W 
J 

(2.2.9) 


a 


ordered  permutation 


of 


x . 


Proof:  Let  0*  be  defined  by  (2.2.7),  then  (2.2.8)  holds.  Now,  we  let 


xi . = x^»  then  we  have 
J 


Hence,  N.  (x)<...<N.  (x)=...=N.  (x)=...=N.  (x'<.  . .<  N.  (v) 

1 “ ’t^+l  ~ ’t  ~ 't:  ' " 'k  ~ 


Now,  suppose  that 


N[,j(x)<...<N[ti+1  ..*[t](x)...-«[t2](!)-...<Ntk3(,)  , 

then 


If  in  (2.2.6)  we  let 


| 

for  j 

such  that 

xi  - 

t^l 

Xj 

> X. 

1 f 
tl 

r4<;>  ■ ■ 

) t-t{ 

- for  j 

such  that 

\ t'-t' 

1 C2  C1 

xi  - 

1 1^+1 

X 3 

J 

> X. 

■ V 

2 

( 0 

for  j 

such  that 

xi  - 

’t^l 

X . 

J 

■ x . 

then  6j(x) 

reduces 

to  <$j(x)  as 

shown  in 

(2.2.9). 

Now,  for  any 

stS, 

let  i ¥ s(T),  consider 

» x] ,. . . ,x^_i  ,X,j  »x^i , . • . ,X|^) 
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and 

X ' (xl Xi-1  *X1  ,x1+l  * — *xk>  * 

where  Xj  u x.'  , then  for  all  j t s(T)  , 


£ «!(x)  > Z 

jts(T)  - ~ jts(T) 


6*(x') 


which  shows  that  z S.(x)  is  decreasing  in  x.  if  i y s(T). 

* J^s(T)  J - 1 

Hence  i:  6.(x)  is  increasing  in  x.  if  i ^ s(T),  because 
j#s(T)  J ~ * 1 

v lSi(x)  + £ <s ( X ) = t.  Similarly,  we  can  prove  that  Z &* 

J'-sd)  J ' jKs(T)  J ' j*s(T)  J 

is  decreasing  in  x.  for  i t s(T ) and  Z 6*(x)  is  increasing  in 

jfes(T)  J ' 

x.  tor  i t s(T).  It  follows  that  Efl[  Z 6^{X)]  is  an  increasing 

? jfes(T)  J ' 

function  of  0.  for  i y s(T)  and  is  a decreasing  function  of  0^  for 

i t.  s(T).  Hence 


sup  EJ  e 


Now , 


jtfs(T) 

ft] 


V?)]  • sf  Ee*t  „£,t 

-«x0o<«-  ~S  jHs(T)  j 


iff  y.  = y 

Jl  < J 


[t] 


where 


yi  = xi 


and  the  distribution  of  Xj  - Oq  does  not  depend  on  0^  any  more.  This 

implies  that  E l 'S*(X)]  is  independent  of  the  choice  of  0_,  so 

-I  Ws(Tl  J 0 


V 

Hy  the  same  argument. 


sup 

'H 


■'*(*)]  * E„.t  £ «!<x>i 


^ jys(T)  J ~ ?s  j*s(T)  J 


inf  E0[  }:  d!(X)]  = E J Z 6*(X)]  . 

jv. s ( T ) ^ ~ s ifeifTl  J ' 


S jts(T) 


Hence,  A*  is  a f -minimax  rule  by  Theorem  2.2.1. 


(x) 


Remarks: 


We  are  considering  location  parameters  for  continuous  distribution. 


hence  the  probability  of  ties  among  x's  is  0.  This  means  that 


the  natural  selection  rule  (select  the  populations  associated  with 


the  largest  t ordered  statistics  among  X's)  is  a r -minimax 


selection  rule. 


Assume  are  the  observations  from  ft.,  and  X.  y X.  . 

1 1 in  l i n j _ ^ l j 


is  a sufficient  statistic  for  . In  this  case,  0.  is  still 


a location  parameter  for  5L  and  hence  the  r -minimax  rule  will  select. 


the  populations  associated  with  the  t largest  sample  means.  One  such 


2 2 

example  is  when  n..  ~ N(e^,a  ),  where  o is  known. 


The  condition  (Lsi+*-s;>)Xs  = c for  s t S holds  if  we  let 
L , 3 Lj,  L ^ and  As  = X for  all  st  S,  then  r reduces  to 

r.  = {r | dr(o)  = A for  all  s fc  S>.  I\  is  a small  class  of  prior 

A J ~ A 


distributions.  But  it  is  interesting  to  note  that  the  r-minimax  rule 


6*  is  actually  independent  of  A.  So  if  we  let  r.  = U where 

, 1 xcl 

I is  a arbitrary  subset  of  the  interval  J = [0,-s-  ],  then  .s*  is 

(t) 


a r-minimax  rule  for  r = I’,. 


The  loss  function  we  used  in  this  section  (see  Definition  2.2.4) 


satisfies  the  monotonicity  and  invariance  properties  of  Eaton's  paper 


(1967),  and  f^(x)  has  the  M-property  (which  is  equivalent  to  MLR 


if  X's  are  independent^  so  from  Eaton's  Theorem  4.1,  6*  is  a 
Bayes  rule  wrt  t for  any  t fc  P'  = {r|  t is  an  exchangeable  prior 


distribution  on  a } . Then,  6*  is  also  a r-minimax  rule  for  any 


r s;  r. 


I 


I 
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b. 


6. 


It  is  easily  seen  that  but  l ^Fj.  To  see  this,  let  k=2,  t=l, 

and  let  t N((q),(q  j^)).  Then,  TtfF'  . However,  pT[°i 
= P L [ -e]  = P[X^f],  where  X-N(0,3),  so  TfcPj.  In  this  sense, 
our  result  is  slightly  stronger  than  Eaton's  (1967). 

If  II.  has  a scale  parameter  0. , then  I^Q  roj(x)  with 

0^  0.  In  this  case,  we  might  like  to  define 

Mc  = t9|  (1-t)  min  9.  > max  0.)  and  r=(i|[  dx(0)  =X  for  all 

5 its(T)  1 i^s(T)  1 ^ s 

stS,  for  someXfcJ).  if  we  use  the  transformation  Y^  = fnX^ , we 

get  Y . ~g(y~n^ ) , where  g(y)  = eyf(ey)  and  = fnO^ , also 

r =ln|  min  n.  > (1+0  max  n. ),  where  1+e'  = . Now,  n. 

its(T)  1 ‘ i^s(T)  1 '"e  1 

is  a location  parameter,  therefore,  to  choose  the  t-largest  n. 's 

(hence  the  t-largest  0.. ’s),  we  will  select  those  populations  associa 

ted  with  the  t-largest  Y. 's  (hence  the  t-largest  X^’s). 


In  Section  2.4,  we  will  see  how  Theorem  2.2.2  can  be  generalised  if 
Xj  ,X,,, . . . ,Xj,  are  not  assumed  to  be  independent. 

2.3  Complete  ranking  and  simultaneous  selection  problems 

Let  l!j  ,11,,, . . . ,11^  be  the  same  populations  as  described  in  Section2.2 
and  * ,<()[k]  tfie  orcIer’n9  or  parameters.  Let  R:  0^  .n^, . . . ,n^} 

- (0,1 , ... ,k-lt  be  a 1-1  function  such  as  that  R(!K)  = j-1  iff  0^  =fl[j]- 
R(  II  ^ ) is  called  the  rank  of  il . When  Qj.Qg*.  ..^  are  un,<nown»  the  ranking 
problem  is  to  identify  each  population  with  its  rank.  A simultaneous 
selection  problem  is  to  decide  the  t-best  populations  for  all  lStsk-1  at 
the  same  time. 

Definition  2.3.1 . For  ranking  problem,  let  A={a|a  = (a(l ),. . . ,a(k))*  is  a per- 
mutation of  (0,1,. . . ,k-l  )*} . So  when  we  take  action  atA,  we  mean  population 
ll.  has  nink  a(i).  Notice  there  are  r=k!  actions  in  A,  which  we  denote  by 
a | * a ^ , . . . , a r . 


pHpagpmpppppi! 


KS 


Definition  2.3.2.  A measurable  function  S : X * A is  called  a ranking 

A 

rule.  A behaviorial  ranking  rule  is  a measurable  function  6 : XxA>[0,l] 
such  that  6(x,*)  is  a probability  measure  on  A.  Then, 

r a/ 

ci.(x)  = E 5(x,a0)a0(i)  is  called  the  rank  of  II.  generated  by  >s. 

1 ~ «=1  1 


In  the  following,  we  would  like  to  show  the  relation  between  the 
ranking  problem  and  the  simultaneous  selection  problem.  Achangeof  notation 
is  necessary  here.  From  now  on,  all  the  notations  used  in  Section  2.2  will 
be  added  a sub- index  t to  specify  that  the  selection  is  for  the  "t" 
best  populations.  For  example, 

k 

= = ( 5-j ^ . . . ,6^)  '\  0 s <$it(x)  < U E 6-t(x)  = t},  for  1 < t u k-1. 


Definition  2.3.3.  A general  selection  rule  is  a matrix  <S  * (<$j  ,<S2,. . . ,<s^ ), 

where  6 . = (6. .... . ,6.  .)'  D.  for  all  1 s j S k - 1.  For  any  x t x, 

J * J 3 ~ 

£(x)  = C<5-j(x)]i(x(k  -| ) , where  6^(x)  is  the  conditional  probability  of 
selecting  II.  as  one  of  the  j-best  populations  having  observed  x. 


Definition  2.3.4,  Let  6 = [6  — ] be  a general  selection  rule,  then 
j(7T  ~ 1J 

* E 6..  is  called  the  rank  of  n.  (i  = l ,2,. . . ,k)  generated  by  iS. 
1 j=l  1J  1 

Now,  we  can  prove  the  following  lemma  to  establish  a relation  between 
<S(x)  and  6(x). 


Lemma  2.3.1.  Let  6 be  a behaviorial  ranking  rule  and 
r „ 

«.(x)  = E 5(x,a0)a(>(i)  be  the  rank  of  II.  generated  by  <S,  then  there 

1 - l-l  ~ ^ A 1 k-1 

exists  a general  selection  rule  <S  such  that  the  rank  ifi.(x)  3 >•  'S<4(x) 

1 ' j=l 
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is  the  same  as  a.(x),  for  all  1 s;  i < k. 

Proof:  Since  x is  fixed,  we  wil 1 use  a.  for  a.(x)  to  simplify 

notation.  The  same  goes^for  ^ . 6,  6^  and  6.  Let  6(a£)  = 3 , 

then  0 s 3£  < 1 and  ^ S£  = 1 . Wlog,  let  a,  € A be  such  that 

dl(’)  ~ * ‘ 1 for  a11  i s i s k.  Then,  for  all  a£  t A,  a£  * P a^, 

where  P£  is  some  permutation  matrix  (2  st  * r).  Now  consider 


h - 


/ o 0 . . 0 \ 

/ 

. . . 1 1 

• o .*  * ; 

) * ik-i  = 

° i . i 

i i . . i / 

kx(k-l) 

V 1 / k-1 


then  it  is  easy  to  check  hence  P^l^  - a£.  Now, 

define  & = ^ 0£P£|1  , where  P,  = 1^.  if  we  ]et  p^  = [p^O] 
and  £=  then  we  have 


kxk 


r k 
Z B,  I 

<1=1  m=k-j+l  im  " £=!'"*• 


0 * *n  = £ B„  z p<*>  < r 3 = i 


and 


& ' ,.£,  etlkp,ill  ■ £ etlk§i  ■ !£{,  * 0.? k-1). 


I'.  It 


Thfk  proves  £ S,.  - j,  hence  6 0.  .here  Sj  . (6fj 

follows  that  s Is  a general  selection  rule  by  Definition  2.3.3.  Finally. 

-k-'  ■ *k)'  * i/Aiiik-i  ■ *,»a 

r 

= JlS1<5<5£)?£  * (a1 <V‘  * 


87 


hence  ^ for  all  1 < i < k. 

We  can  also  prove  that  given  a general  selection  rule  6,  there 

„ r ^ 

exists  a behaviorial  ranking  rule  6 such  that  z 6(a0)a0  = 61 . , . 

£=]  I 

The  proof  is  very  similar  to  the  proof  of  Lemma  2.2.1.  We  consider  that 

V = {61  k_-j  | 6 is  a general  selection  rule},  then  V is  a closed, 

bounded  convex  set  in  1R  . Now,  if  6 = [6..],  but  6..  are  not  all 

T J 1 J 

0‘s  and  l's,  then  6lk_i  is  not  an  extreme  point.  It  turns  out  that 

the  extreme  points  of  V are  P£§-| = a£,  so  for  all  61  k 1 £ V, 
r 

6lk_^  = Z Now»  set  5(|A)  = thus  completes  the  proof. 

From  the  above  discussion,  the  rank  of  n.  generated  by  behaviorial 
ranking  rule  6 or  by  general  selection  rule  6 can  be  treated  as 
equivalent.  In  the  follwoing,  we  will  consider  the  ranking  problem  through 
the  general  selection  rule  6,  i.e.,  we  would  like  to  select  the  t-best 
populations  for  1 s t s k - 1 simultaneously  and  hence  to  rank  the 
populations  in  some  order. 

Let  D = {6 1 6 is  a general  selection  rule}.  The  notation 

£ % 

Lt(e,6t(x))  and  rt(i,6t)  wil  mean  the  same  thing  as  L(e,6(x))  and 
r(t,6)  in  Section  2.2.  Because  t is  a variable  rather  than  a fixed 
integer  in  this  section,  a sub-index  t is  added  to  make  the  notations 
clear. 

An  intuitive  loss  function  for  the  simultaneous  selection  problem 
k-1 

is  Jl(9,6(x) ) = Z L.(e,6,(x)).  Since  the  loss  is  additive,  so  by 
t=1 

Lemma  1.7.1  and  Theorem  4.1  of  Eaton  (1967),  6*  = (6*,... »§k_i ) is 

Bayes  rule  for  any  exchangeable  prior  distribution,  where  6^  is  as 

given  in  (2.2.9)  for  1 s t a k - 1 . Then 

sup  r(-r,6*)  s sup  r(t,6)  for  all  6 £ D, 

T €r  * r€r 
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i-e.,  is  a r -minimax  rule  for  r = r'.  However,  when  we  use 
d(o,6(x))  as  our  loss  function,  we  find  that  there  is  no  indifference 
zone  for  6;  but  for  each  t (1  s t $ k - 1),  has  its  own 

indifference  zone.  If  we  feel  we  should  not  be  penalized  in  the  problem 
of  selecting  the  t-best  populations  (t  fixed)  when  the  t-best  populations 
are  not  distinguishable  from  the  others,  then  neither  should  we  be 
penalized  in  the  simultaneous  selection  problem  if  any  two  populations 
are  not  distinguishable  from  each  other.  Thus,  we  need  to  have  an 
indifference  zone  for  6,  which  is  done  as  follows:  Let 

C = {c | c is  a permutation  on  {l,2,...,k}  }, 
and  for  c £ C,  let 

®c  = {?l0c(l)  > 9c(2)  > •••  > 9c(k)  and  ec(i)  ' 9c(i+l) 


> e for  1 s i < k - 1 }, 


then 


(30  = u 0C  = (e| 


min 


I e,. -0 . 1 < e } serves  as 


c<C 


liisjsk 


an  indifference  zone.  Now  let  = (c|  i = c(j)},  then  for  all 


'ij 


c t C .j  and  8fcac,  8,  = 8c(j) 
largest  parameter.  Note  that  C 


,th 


1 u t s;  k,  1 s i s k. 


let 


is  true,  i.e.,  0.  is  the  j— 

k 1 

1 U C..,  for  all  1 s i s k. 

j->  u t k 

U 


For 

^it  = ^ ^ii»  = ^ ^i i * 

n j=l  1J  U j=t+1  1J 


then  for  9 0c  and  c fc  G^t  (Bit  respectively),  we  have 
0^  is  (is  not)  one  of  the  t largest  parameters.  Now,  we  can  define 
the  loss  function  as: 


& 
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Definition  2.3.5.  For  any  0 € 0 and  & € D,  let 


L(6,6)  = z *t(e,<5t(x)), 
t=l  1 - 't  ~ 


where 


At(0,«St(x))  = z Z l4c)  (0,6-f ( 
1 ~ i=l  ctC  1 - U 


L{C)(0,«Sit(x))  - 


0 if  0 $ 6C 

L] (1  -^i t(x) ) if  c 6 G.t  and  e fc  u3c 

l2^i t (x ) 1 f c t Bit  and  0 t 0 . 


Let  rx  = {t I jQ  dx(0)  = A for  all  c t C}.  We  see  that  is  an 
indifference  zone  for  6(x).  Now,  we  can  prove  Theorem  2.3.1  which  will 
be  used  to  find  a simultaneous  r-minimax  selection  rule. 


Theorem  2.3.1.  If  for  all  etc,  there  exists  a 0*  t e such  that 

~c  c 


& Vj-wi  5c(j)t  s°<jx 

0te  Vj-l  (~)]  = E^Cj«i6c(j)t:  ’ 


(2.3.1) 


where 


<?,<«> 


' if  "u<«>  ‘ N[t]t<x> 

t2 

rft(x)  if  N1t(x)  = Nj-t-jt(x),  with  l rit(x)»t-t| 
0 if  Nu(x)  > N[t]t(x) 


(2.3.2) 
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where 


N..(x)  = Lo  Z ffl*(x)  - L,  E ffl*(x) 


cfcBit  ~C 


cfeGit  2c 


If  Tn  is  defined  as  P [e  = e*]  = X for  all  c C,  then  we  have 

Tq  ~ ~C 

rt(T0,6t)  ^ rt(T0»<V  > rt(x,6G)  for  a11  6t  t Dt  and  T 6 rx’ 


Proof:  The  proof  is  similar  to  that  of  Theorem  2.2.1,  so  we  only  write 
down  the  main  steps  and  skip  the  details. 


k f 

.(x0,6t)  = Kt  + X ^ J^k  Nit(x)  6it(x)  dx 
* Kt  + AE  j k d? 


i=l  V 


rt^T0’~t^ 


= A E {L,t  + l_2  Efl*[  E 6"/iU  (X)] 
c6C  1 2 §c  j=t+l  C(j)t  ~ 


" L1  E0*I-jE1  6c(j)t  (*)J} 


> E f l.t  + L,  E [ E 6?f.vt  (X)] 
c€C  J®c  1 29  j=t+l  C(J,t  ' 

• Li  EeC  j,  s?(J)t  <*»  *<•> 


rt (t ,6^)  for  all  t € 
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where 


k f 

- X E L-.  Z dx 

1 i=l  h 1 c€Git  §c  ~ ~ 


This  completes  the  proof. 


Now,  for  all  c £ C,  we  define  0C  * (e^ ,0-,,. • • ,0^.)'  £ ac  as 
0c(j)  = 0O  - je  for  all  1 s j s k. 


(2.3.3) 


Again,  0g  will  be  determined  later.  We  would  like  to  examine 
N^(x)  s Njt(x).  We  find 


Nit(x)  s Njt(x) 


£ V(x)  i £ fA*(x) 
c«,«  2c'  *GJt  ~c 


<— > E 


fo*(x)  > I fA*(x) 


c€G.t\G.t  Hc~  cfeGJt\Gu2c 


Now,  for  all  c £ Gjt^Gjt’  1 = for  some  1 < t and  j=c(jO 

for  some  t + 1 s j'  s k.  If  we  let  c'  be  such  that  c"(£)  s c(fc)  if 
l f 1".  i t j'.  and  c'(i')  = j,  c'(j')  = i,  then  c' t Gjt\Git'  The 
correspondence  c +-+c'  is  1-1  between  Gu\Gjt  and  GJt\G,t.  So 
if  we  let 

9c(x)  = fe*(8)  - fe*  (x)  for  all  c t Git\Gjt. 
c * 

we  have 


Nn(x)  s Njt(?) 


z gr(x)  ^ o. 
ctGit\Gjt  C~ 
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Now, 


9c(x)  = c n f0  [fe  -iv<xi)  fe  -iv(xi) 

i-ti'.wy  0 e °n_1  e 1 en-J  c J 


V,'e<xJ)  Vj'e<X()]  • 


If  fQ  (x)  has  MLR  for  all  1 s i s k,  then  we  have 


xi  * Xj  ^ 9c<?>  - 0 for  a11  c e Gi^Gjt  » 
Nit(?}  * Njt(^  • 


(2.3.4) 


Theorem  2.3.2.  Let  .X^, . . . ,X^  be  independent  random  variables,  where 

X.  has  pdf  fa(x)  = f (x-0 . ) which  has  MLR  in  x.  Let 
1 °i  1 

...  -x[t]. ...  -x[t2] 


if  x..  > 


if  x.  = x^ 


(2.3.5) 


if  x.  < 


u 


I 


Proof : Let  6*  be  defined  by  (2.3.3),  then  by  (2.3.4)  we  see  that 
(2.3.5)can  be  considered  as  a special  case  of  (2.3.2),  as  was  shown  in  the 
proof  of  Theorem  2.2.2.  Also,  by  an  argument  similar  to  that  in  the 
proof  of  Theorem  2.2.2,  we  can  prove  that  (2.3.1)  holds  for  any  choice  of 
0q.  So  by  Theorem  2.3.1,  we  get 

inf  rt(T0*6t)  - rt(T0’6t}  - sup  rt^T,<SP 
*t*Dt  ~ *rx 

for  all  1 s t < k - 1.  Hence, 

k k 

inf  r(xn,6)  = inf  E r (xn,Sf)  2 1 inf  rt(Tn»§?) 

6€D  U ~ 5€D  t=l  1 t=l  6,,€0*  1 0 l 


— t 


K * * 

i E rt *'t ^ " r ( Tq , 6 ) . 


sup  r(x,6*)  = sup  E rt(x,6*)  < E sup  r (x,$J 
rfen  ~ i€n  i=l  Z i=l  x€l\  r 1 


S E rjxn,6*)  = r(xn,6*)  < inf  r(xn,6) 
i=l  1 U 'Z  U 66D  U ~ 


< r(xn,6)  < sup  r(x,6) 
° *r* 


for  all  6 t D.  So  6*  is  a r -minimax  simultaneous  selection  rule  for 

r - tx. 


Corollary  2.3.1 . Let  I c [O.^y]  and  rj  = u , then  6*  is  a 
r-minlmax  rule  for  r = rT. 
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Recall  that  our  main  purpose  in  doing  the  simultaneous  selection  is 
to  rank  the  populations.  When  there  are  no  ties  among  x^'s,  i.e. 


then  the  rank  vpi  (x ) generated  by  6*(x)  is  j - 1,  and  hence  n. 
0 

has  rank  j - 1.  If  ties  occur,  wlog,  let  us  assume  that 


I 
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So, 


k-1  * 

<Mx)  = E S.Jx) 
l - t=1  it  ~ 


k-1 


1+d, 

+k-1"tj‘dj 


if  ^ + dj  < ^ < tj+1+1  where  t0=d0=0 

if  t.  + 1 < i < t.  + d.. 

J J J 


<l>.|(x)  is  called  midrank  of  x and  this  justifies  why  the  midranks  for 
tied  data  should  be  used  for  rank  test.  For  use  of  midranks  for  tied 
data,  see  Lefctann  (1975). 


2.4  r -minimax  rules  for  hypothesis  testing  in  a multivariate  case. 

We  start  with  a result  in  Lehmann  (1955),  which  we  state  as  ?.  lemma 
without  a proof. 

Definition  2.4.1.  When  x = (x-j ,. . . .x^)",  x'=  (x^,. . . ,x£)',  we  define 
x s x'  iff  x..  < Xj  for  all  1 s i < k.  A measurable  set  S is  increas- 
ing iff  x € s and  x s x'  implies  x'  fc  S. 

Definition  2.4.2.  A family  of  distribution  t F0(x) said  to  have 
stochastically  increasing  property  (SIP)  iff  when  0 z 6'  and  S is  an 
increasing  set,  we  have  dF0(x)  s |$  dFQ.(x). 

An  example  of  SIP  family  is  when  F0(x)  = F(x-9),  i.e.,  e is  a location 
parameter.  The  following  lemma  is  due  to  Lehmann  (1955). 


i 


■ **  >*  . 
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Lemma  2.4.1.  Let  fF0^?^Ote  be  a of  distribution  with  SIP.  If 

6 is  a real -valued  function  such  that  6(x)  s 6(x')  for  x $ x',  then 
E()[cs(x)]  * eq.[6(x)]  for  e s er 

When  e is  a location  parameter,  the  above  lemma  can  be  generalized  to 
Lemma  2.4.2. 

Lemma  2.4.2.  Let  {F(x-0)}Q6e  be  a class  of  distribution.  If  6 is  a 
real-valued  function  such  that  6(x  + ta)  > 6(x  + sa)  for  t a s,  then 

so  1S  Ee+  ta  * E0+  sa[5(x)L  where  a an  arbitrary  vector 

in  Fk. 

Proof:  Let  A be  any  non-singular  matrix  with  a as  its  first  column. 
Let  Y = A ]X  and  6{x)  = 6(Ax),  then  when  X - f(x-e),  we  have 
Y ~ cf(x-n)  where  c = | det  A|,  f(x)  * f(Ax)  and  n = A-1e.  Also,  we 
let  g(e)  - E0[6(X)]  and  g(n)  = E [8(Y)].  Now, 

<S(x  + te^)  = 6(Ax  + tAe^)  = 6(Ax  + ta) 

£ 6<Ax  + sa)  = 6(x  + se^ 

for  t a s,  so  6 is  increasing  In  its  first  component.  Since  n is 
the  location  parameter  of  Y,  this  implies  g(n  + te^)  a g(n  + se^  if 
t > s.  But 

g(e)  = J<s(x)  f (x  - e)  dx 

= jfi(Ax)  f(Ax  - AA^e)  |det  A|  dx 
■ c Js(x)  f(x  - A-1e)  dx 
a g(A_1e). 
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hence, 


g(6  + ta)  * 8 + tA_1a)  = g(A_1e  + te^ 

> g(A’^e  + sej)  = g(e  + sa)  for  all  t * s. 


This  completes  the  proof. 


Remarks : 

1.  One  may  notice  that  If  FQ(x)  * F(x-0),  then  Lemma  2.4.1  is  an 
immediate  result  of  Lemma  2.4.2. 

2.  If  both  6(x)  and  g(e)  are  differentiable,  then  we  have 

k ~ k 

E a.  — 6(x)  > 0 =>  e a,  g(o)  * 0 for  any  a. 

1=1  1 ~ i=1  i 36i  - 


Example  2.4.1.  Let  the  random  variable  X has  pdf  f(x-e) 

* h(x)  c(e)  e®  * , where  0 € Fk  is  unknown.  Also  let  p be  any 
vector  in  Fk.  We  want  to  test 


Hq  : p'e  i c + e 


Hj  : p'e  s c 


Suppose  we  know  the  prior  distribution  of  0 Is  In  the  class 
r 3 {x|  Pt[0  e hq]  = x,  PT C©  £ Hi]  = X'},  where  Os  x,  X'  and 
X + X'  s 1.  If  the  loss  is  defined  as: 


(H0UHi )c 
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where  aQ  means  'HQ  Is  true'  and  a^  means  'H^  Is  true'.  To 
determine  the  r-minimax  rule,  we  proceed  as  follows: 


.£1 


Solution:  Let  * (6, . — .0. ) . on  « and  o, 

K ||  g||  1 ||g||c 

Let  rn  € r be  such  that  P [e  * 8n]  = A,  and  Pt  [6=6,]  ■A".  Also, 

l0  ~ ~u  tq  - ~ 1 

let  0 = {6 J 6 is  a measurable  function  on  Fk  such  that  6(x)  6 [0,1]}. 
For  6 t D,  6 ( x ) is  the  probability  of  saying  HQ  is  true  having 
observed  X = x.  Now, 

r(rQ,6)  = j LjAO-SOO)  f(x-0Q)  + L2A'6(x)  fU-e^dx, 


so  the  Bayes  rule  wrt  tq  is 


But 


60(?}  s I[L1xf(x-e0)iL2A'f(x-e1 )3  (?)  * 

II  BH  LoA  'c(e,) 

L,Af(x-60)  i L2i-f(x-e,)  <-  x-6  i J|L  tn  tprc'I^T  * k0  • 


Since  (x  + tg) 'g  ^ (x  + sg)'g  if  tjs,  so  6Q(x  + tg)  i 6n(x  + sg). 


hence  by  Lemma  2.4.2, 


E0+tg^O^X^  - ^ t i s 


‘0+sgLU0 


Now,  let  e t Hq,  then  e * vg  + ur  where  g"r  a 0.  Since 
g'Oi  c + e *>  v > ---%  , hence 

Hell2 


But 


W*»  > Ee0+ur[6°(?)] 


E0o+ur^6O^X^  = * k0  ■ ?o!  ■ u:V 

* P[X^g  ikn-  e^g]  where  XQ  - f(x) 


“0?  * 0 ?0r 

■ yso<?» 
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so  we  have  proved  that 

*h  w*-)]  ■ y vs’)  • 

~ 0 

Similarly, 

JJJ  Ee^tx)]  * E ^ C 60 ( ^) 3 • 

Then  we  have  for  all  t fc  r , 

r(t,db)  = |h0  Li(1-Ee[6o™  dT(®)  + 1h1l2e6C6o()1)]  dT(?) 

< XL^l-Eg  [«0(X)])  ♦ X'L2Eg.[60(X)3 

* r<vV  • 

Hence  <SQ  Is  a r-m-'nimax  rule. 

We  have  now  displayed  many  examples  for  deriving  theP-mlnlmax  rules 

(In  Chapter  1:  r-mlnlmax  rules  for  selecting  populations  close  to  a control; 

In  Section  2.2  and  2.3  of  this  chapter:  T-mlnlmax  rules  to  select  the  t-best 

populations).  One  might  have  noticed  that  the  common  setting  of  the 

T-mlnlmax  problems  we  have  considered  Is  that  we  have  a partition  on 
r 

such  that  d » ( U«.)  U , where  flu  serves  as  an  Indifference  zone,  and 
1-1  1 0 0 

T Is  defined  as  the  class  of  prior  distributions  which  put  some  known  mass 
on  each  (lsi<r).  Then  we  will  do  the  routine  job  of  choosing  a ^ J\ 

In  general,  Tq  Is  a degenerate  prior  distribution  which  puts  all  the  mass 
on  the  boundary  of  ^ . If  we  can  prove  that  is  also  the  least  favorable 
distribution  In  r for  Its  Bayes  rule  5q,  then  ^ Is  a r-mlnimax  rule. 


n ■ilijuj  lunii 
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All  the  problems  that  we  have  considered  so  far  are  under  the  assumption 
that  all  populations  are  independent,  but  this  condition  can  be  relaxed  in 
certain  problems.  Let  us  look  at  the  following  example  first: 


Lxample  3.4.2.  Let  X ~ Nk(0,E)  where  E = (1-p)  IR  + p J 1',  P is 
1 k 

known  and  p > r-V-  . Let  = ( U «.  )U&,  where  8.  ■ (e|  0,>max  0.+e>, 
f 1*1  1 0 k 1 ' 1 j^1  J 

! = L dT(°)  = ^ K with  X.  i 0 and  E X.  $ 1 being  given. 

J i 1 i=l  1 

Define 


L(0,6(x))  = 


L i(l-61(x))*£.  I 6. (x) 
i i - j - 


if  0 t ^ 

if  0 t 8., 


k 

where  6 fc  D = { 6 1 E 6.  (x)  * 1 and  6.  (x)  > 0}  . 

~ i=l  1 ' 1 

We  want  to  determine  the  r-minimax  rule. 


This  problem  is  known  as  the  selection  of  the  best  population  and  it 
was  considered  by  Gupta  and  Huang  (1977),  for  p = 0.  When  X^'s  are 
equi -correlated,  we  let  0*  = ( 6^, . . . , 0Q+e, . . . , ^)  for  1 <i  <:  k.  Also, 
let  t«  be  the  prior  distribution  such  that  P [0  = 9*]  = X.,  then  the 

U TQ  ~ ^ * 

Bayes  rule  wrt  fQ  is  6°  = ( 6® , . . . , 6®)  where 
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Now,  let  jf1  * [oiJ],  then 


m n^-Tiofn-pT  for  1 S 1 - k 


c1*  • 


-P 


[l+(k-l )p](l-p) 


for  1 f j 


then 


^0*(x)  > ^0*(x)  (Lj+fj)Xj 


(L.+  t,)X 

l^IxT  ' “Tl/VN ' “ 


e(o^-cr^ ) (x^-Xj)  > tn  c 


ij 


<_>  X)  . Xj>  tn  c(J  . 


We 


see  that  5^(x)  is  increasing  in  x^  and  is  decreasing  in  for 
j t 1;  also,  6?(x)  is  independent  of  the  choice  of  Oq.  Hence  we  get 

* E(0,...,0,c,0 


which  proves  that 


r(t,6°)  - | U^)  (1-E?[«°(X)])  dx(e)  s r(T0,60). 


hence  6 is  a r-minimax  rule. 


The  above  example  can  be  generalized  to  a more  general  theorem  which 
we  state  as  follows: 


Theorem  2.4.1.  Let  X has  pdf  as  f(x-o).  If  the  ratio 


f(x-ce) 


'y~ 


is  an  increasing  function  of  x.  and  is  a decreasing  function  of  x^, 
keeping  the  other  components  fixed,  then  the  problem  of  selecting  the  best 


c 


6JX)  3 


'l  - 


/ 


( 


nax  rule  = (6^, 

where 

if 

rtJ(x)>  ctj 

for  all 

j t l 

if 

r (x)  Z cBs 

for  all 

j t e and  '= 

* j ~ ej 

some 

if 

re/?)<  ctj 

for  some 

j 

Proof:  Use  the  same  argument  as  in  Example  2.4.1  except  the  monotonicity 
of  rij(x)  *s  now  guaranteed  by  the  assumption  instead  of  computation. 


Remarks : 
1 . 


The  monotonicity  of  r..(x)  is  satisfied  if  f(x-o)  = It  g(x.-e.) 

J ~ ~ ~ i*l  1 1 

and  g ( x- 0)  has  MLR  in  x. 


If  Yj  ,Y2> . . -Yk,Z  are  (k+1)  independent  random  variables  and 

oip(y) 

- 11  I V"  M.  I — I'  I M.  I II  I V I ^ 

1 


Y,  ' g(y-ih)  = c(04)h(y)e  ,r'  , where  p(y)  is  a strictly  increasing 


1'  ' v 

function  of  y for  1 s i s k,  and  Z is  an  arbitrary  random  variable 


with  pdf  as  q(z).  Now,  if  + Z,  then  letting  X *(X,,...,Xk), 


j 


w 


; 


Jtmd 
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«->  N.(x)  s;  Nj(x) 


If  X.'s  are  not  independent  but  X.  * ♦ Z with  Y.  and  Z as 

defined  in  remark  2,  one  finds  that  when  we  choose  0Q  * 0. 


(x) 


r.  Z p(xj 

e *s(T)  1 


qU)  dz  . 


Hence  (2.4.1)  holds,  and  Theorem  2.2.2  is,  therefore,  still  true. 

As  the  last  part  of  this  section,  we  would  like  to  search  for  the 
r-minimax  rules  for  some  hypothesis  testing  problems  when  X has  a 
multivariate  density  fQ(x).  but  e Is  not  a location  parameter. 

?•  $ ■ 3.  Let  X = (Xj,...,Xk)  has  pdf  fQ(x).  If  the  marginal 
distribution  of  (X2....,Xk)  has  pdf  g(ft  X|()and 

X]  t X2,-,*»X|C  ~ hn(e)(xi  I x2 • • • • •x|c) • where  n(§)  is  an  increasing 

function  of  and  h^Xj  | x? X|()  has  MLR  in  xr  Then  if  6(x) 

is  an  increasing  function  of  x.,  we  have  that  EQ[<$(X)]  is  an 
increasing  function  of  e.j . 

P-T-P-P-f-'  Let  p - (Oi  ,02» . . . ,0^) , 0'=  (0^,02, . . .^k)  with  0^  ^ 9^, 
then  n(e)  > n(0*).  so 

En(e)^?^  X2 Xk^  * X2» • • • »X|(] 

for  all  X2,...,Xk.  Now, 


V5<*>J  ' eo2 ekCEn(o)CMX)!  x2 xfc]] 


- Ee2,...,t>k 


= E0-[«(X)]  . 


Examples  of  f^(x)  satisfying  Lemma  2.4.3  are: 
1.  Multinomial  distribution  MN(n,o): 


V*>  = 


xi!)(n-iE1  xi)!  v"1 1 A 1=1 ) 


k xA/  k 

n e.  Mil  - e e 


k V"  = xi 


(2.4.2) 


(n  - T.  x.)! 
i*2  1 


x,!(n-  E x.)!  V 1-  E 0. 
' 1=1  1 \ i=2  1 


) \ '-tfj  - 


k x,f 

n 0 . Ml  - 


( n x . ! ) (n-  l X.)! 
1=1  1 i=2  1 


n-  I x. 
i=2  1 


We  find  that  n(o)  * ^ which  is  increasing  in  0^,  and 


1-  Z 0 . 


X,  | X,,...,X,.  ~ b(n  - l x . , n(e) ) which  has  the  MLR,  and  the  marginal 

=2  1 
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2. 


distribution  of  (X? Xj'  ~ MN(n,(e2, . . . ,0fc)-) , so  that 

Lenma  2.4.3  applies  for  this  distribution.  Furthermore,  by 
the  symmetry  of  the  density  of  multinomial  distribution,  we  get 
if  6(x)  is  increasing  (decreasing)  in  x . , then  En(6(X)]  Is 
increasing  (decreasing)  in  0^  for  any  1 i i ^ k. 

Multivariate  negative  binomial  distribution  MNB(n.e): 


(n+  z x - -1 ) ! 


V;>  ■ 


i=l 


n/,  . 


n e4  l ♦ e 0i 


(n-l ) ! n (x. ! ) 
i=l  1 


i = l 


**  Vi 

i=l  1 


(2.4.3) 


f (n+  Z x . -1 ) ! 
i=l  1 


T 


(n+  z x.-l )! 


x ' 


i=l 


1+  z e 


i=2  1 


f 


1 + 


-(n+  E x.) 


i=l 


\ 


'*  1$2®f 


(n+  E x.-l)! 
i=2  1 


n e. 


xi 


(n-l ) ! n x.! 
i=2  1 


i=2 


k 

1 + E 
i=2 


,-(n+  E xj 


i=2  1 


0 


We  find  0(0)  = k which  is  increasing  in  0, , 
1+  E 0.  1 

i=2  1 

k 

X-i  | X2»...,X.  ~ NB(n  + E x . , 0(0)),  and 

i=2  1 


(X2»...,X^)'  ~ MNB(n,  (©2» . . . ,0^) ") , so  the  same  result  for  multinomial 
distribution  also  holds  for  multivariate  negative  binomial  distribution. 


■ 

I 
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3.  Multivariate  normal  distribution  Nk(e ,z) : 

££,ni  ? «• 

E21  E22  J * \e2 J ' 

then 

?2  ""  ^k-l^?2,E22^  an<^  ^i  I ~2  ^ ^(®i+^2i^22^2~~2^  * 

„ -1 

°11  " E21 E22E22} 

So  Lemma  2.4.3  holds.  But  in  the  multivariate  normal  case,  since 
0 is  a location  parameter.  Lemma  2.4.2  is  stronger  than  Lemma 
2.4.3. 


Example  2.4.3.  Let  be  i id  MN(n,e) 
(2.4.2).  Let  0 = (e|0  < 0.  for  1 s;  i s k and 
We  want  to  test 

t 


with  pdf  as  in 
k 

Z 0.  < 1}. 
i=l  1 


t 

H,  : Z 0.  < a 
1 j=l  J 


where  t € {1 .2,  — »k>  and  0<a<a+c<l.  If  both  r and  the  loss 
are  the  same  as  in  Example  2.4.1,  we  can  proceed  as  follows  to  get  a 
r-minimax  rule. 


Solution: 


Let 


a+e  1 -a-e 

* T * OTT 


and 


?! 


a 1 -a  1 -a  . 

t * k+l-t  * **•  * k+l-t  ' • 


Let  xn  £ r be  such  that  P [e  = e„]  = X and  P [e  = e,]  = 
u T0  ~ To  ~ -1 

then  the  Bayes  rule  wrt  tq  is 


60(?}  = ^Laf  (x)  > L2X-ffl  (x)] 

?0  ~ c ?!  ~ 


(x). 


Now, 


LiAfe0^  1 L2x"fe,^ 


°i 


<«■  5 

j=1  v a 1-a-e  ’ LjA  v 1-a-c  > 


L„A' 


In 


rp  + n lnr^i 


l-a 


w 1 x,  > 


J-'  J ' **  *r * 


= c. 


Then,  6 Q(x)  = I t (x)  is  increasing  in  x.  for  all  1 < j < t. 

[ Z x,>c  ] " J 

j=l  J n 


Hence,  E0[6  (x)3  is  increasing  in  0.  for  all 
t - t J 

Z.  x.  - b(n,  Z 0.),  Efi[6n(X)]  depends  only  on 
j=l  J j=l  3 ~ 0 * 

Inf  Ee[«0(X>]  - E,  [«„<»)] 


1 S j St.  Now,  since 

I 0,.  So  we  get 
j=l  J 
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then  we  find 


f8  M 

fn  (X) 
?0  ~ 


= "n  5 Llf+eJjl+O 

^ 1+a  J j_i  a(1+a+e) « 


Hence,  if  rn  is  such  that  P [6  = 0rtl  = X and  P [0  = 0,1  = X", 

u Tq~  ~0j  Tql-  ~1j  ’ 

the  Bayes  rule  of  Tq  is 


= 


In  +n£n  -+^+e 


1 if  E x,  > 


L1X 


1+a 


= b. 


Since  E x.  ~ NB(n,  E 0,),  so  everything  is  the  same  as  in  Example 
j=l  J j=l  J m 

2.4.3,  i.e. , sup  r(r,6n)  = r(xn,6n).  If  we  consider  E x,  as  the 
r€r  0 0 °m  1=1  0 


sufficient  statistic  for  0 and  Ex,-  MNB(mn,e),  we  get  the 

1=1  -1 

t’-minimax  rule 


V?r*,*,5n)  = ]r  "j  l x > b i {?1 ?m}  ’ 

i=l  j=l  ij  ' mn 


where  x<  = (x^  , . . . .x.J  for  all  1 < i < m. 


ik' 
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CHAPTER  III 

EMPIRICAL  BAYES  RULES  FOR  SELECTING 
GOOD  POPULATIONS 


3.1.  Introduction 

We  assime  that  G Is  an  unknown  prior  distribution  on  fc,  and  de- 
note the  minimum  Bayes  risk  In  a decision  problem  by  r(G).  Robbins,  in 
his  pioneering  papers  (1955,  1964),  proposed  sequences  of  decision  rules, 
based  on  data  from  n Independent  repetitions  of  the  same  decision 
problem,  whose  (n+l)st  stage  Bayes  risk  converges  to  r(G)  as  n > »*. 
Such  sequences  of  rules  are  called  empirical  Bayes  rules.  Empirical 
Bayes  rules  have  been  derived  for  multiple  decision  problems  by  Deely 
(1965),  Van  Ityzln  (1970),  Huang  (1975),  Van  Ryzln  and  Susarla  (1977), 
and  Singh  (1977).  However,  the  forms  of  densities  of  the  populations 

n w 

that  these  authors  considered  are  either  c(e)h(x)e  , for  continuous 

case  or  c(o)h(x)ex,  for  discrete  case,  and  the  loss  functions  are 

either  squared  error  or  merely  max  e.-o,  type.  Fox  (1978)  discussed 

l<j<k  J 1 

some  estimation  problem  under  squared  error  loss.  In  which  empirical 
Bayes  rule  was  derived  for  the  first  time  for  uniform  distributions. 

Barr  and  Rlzvl  (1966),  and  McDonald  (1974)  also  considered  selection 
problems  related  to  uniform  distribution  by  the  subset  selection 
approach.  It  Is  Interesting  to  note  that  uniform  density  Is  a good 
approximation  to  the  central  portion  of  normal  density.  We  consider  one 


' < 


:■ 


* 


4 


\ 
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industrial  example.  One  often  wishes  to  keep  the  resistance  In  a 
circuit  constant.  If  the  resistance  Is  normally  distributed,  then  re- 
sistors with  resistance  In  1*  tolerance  Interval  about  the  mean  are 
selected  as  "high  quality".  In  this  case,  the  uniform  distribution  can 
be  used  as  a model  for  these  high  quality  resistors.  In  Section  3.2, 
empirical  Bayes  rules  are  found  for  selecting  populations  better  than 
a known  control  when  the  populations  are  uniformly  distributed.  In 
Section  3.3,  the  same  problem  Is  considered  except  that  the  control 
parameter  Is  unknown.  In  Section  3.4,  we  derive  the  empirical  Bayes 
rules  for  populations  with  densities  of  the  form  p^(x)c^(e^)I^Q  Q j(x). 

Rate  of  convergence  is  also  discussed  in  this  section.  Finally,  Monte 
Carlo  studies  are  carried  out  for  the  prior  distribution 


G(o)  = ^ c)(e).  The  sma^est  sample  size  N Is  determined  to 


guarantee  that  the  relative  error  Is  less  than  e. 

3.2.  Known  control  parameter 

Assume  that  we  have  k populations  n,  ,n»,.  • • ,n. . ~U(0,e^) 

and  Is  unknown  for  1 ^ 1 1 k.  Let  0g  he  a known  control  para- 
meter, we  define: 


Definition  3.2.1.  Population  Is  good  Iff  > oQ,  and  population 
n.  Is  bad  Iff  <_  eQ. 

Let  A » { 1 | > Oq } and  B ■ { 1 | <0^1,  then  A(B)  Is  the  set  of 
indices  of  good  (bad)  populations.  Our  goal  Is  to  select  good  popu- 
lations and  reject  bad  ones.  We  formulate  the  problem  In  the  empirical 
Bayes  framework  as  follows: 
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(1)  Let  fc  * » (©i  * * * * *0^)  I * 0 for  *11  1 < i < k}  be  the 

parameter  space. 

(2)  Let  W»  {S|S  c.  U, 2, •••,!(}>  be  the  action  space.  When  we  take 
action  S,  we  say  Is  good  If  1 € S and  Is  bad  If 

1 JE  S. 

(3)  Let  L:  b * j»!  +(0,»)  be  the  loss  function.  We  define 


M~'S)  ' L'  tJ\s(V6o)  * L?  iJns<V9|)' 


(4)  Let  G(<l)  = n G^o^)  be  an  unknown  prior  distribution  on  0, 


where  G^e^)  has  a continuous  pdf  g^(o^). 

(5)  Let  be  pairs  of  random  variables 

from  ilj  and  Yjjlojj  * ~U(0,o^)  for  all  1 < 1 < k and 

1 < J<n.  let  Xj  ■ then  Yj  denotes  the 

previous  j-th  observations  from 

(6)  Let  be  the  present  observation  from  n^,  for  all  1 < i < k. 


Let  X«(x  ■ (xj,***,X|c)|x^  > 0 for  all  1 <.  1 <.  k).  Also,  let 

im  <*1 y «*<*  since  the  ioss 


function  Is  bounded  from  below  and  we  are  Interested  In  the  Bayes 
rule,  we  can  restrict  our  attention  to  the  non- randomized  rules. 

We  have 

(7)  0 ■ {«|6:  1 -*■»  Is  a measurable  function).  0 Is  the  collection  of 
decision  rules.  Let 


r(G)  * Inf  r(G,5)  - r(G,6*), 
itD 


(3.2.1) 


1 
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then  6*  is  the  Bayes  rule  wrt  the  prior  distribution  6,  and 
r(G)  is  the  minimum  Bayes  risk. 

Definition  3.2.2.  A sequence  of  decision  rules  (6  (x.Y Y 11“  , 

. ’•'fi  n=l 

is  said  to  be  asymptotically  optimal  (a.o.)  or  empirical  Bayes  (e.B.) 
relative  to  G,  if 

r„<='V  s f cef  u«.«n(&.lt, 

* JL  * t* 

" r(G)  (3.2.2) 

as  n > ».  The  expected  value  In  (3.2.2)  Is  taken  wrt  Y,,***,Y  . 

'**1  Nl 

Remark:  For  simplicity,  • • .Xj  will  be  denoted  as 

from  now  on. 

Let  m^(x)  be  the  marginal  pdf  of  Xj  and  M^(x)  be  the  marginal 
distribution  of  X^.  Then,  we  have 

m.(x)  = jj-dG^(e^)  for  all  x > 0,  and 

■ fft » dGt(9)dt 

■ 00  6 dtdGf<9)  4 00  ? dtdG((#) 

= xm^(x)  + G^(x). 

Hence, 

Gj(x)  ■ Mt(x)  - xm^(x). 


(3.2.3) 


1 IS 


With  the  help  of  this  formula,  Me  are  able  to  get  a sequence  of  a.o. 
decision  rules.  As  the  first  step.  Me  Mould  like  to  find  r(J»)  and 
the  associated  Bayes  rule.  To  get  the  Bayes  rule  easily.  Me  Mill  change 
the  form  of  the  loss  function  to  the  folloMlng: 

L*9,s* " " Ll(ere0)I(e0,-)(81)] 


+ 1J1h(9i"<’o)I(eo,.)(0i: 


(3.?. 4) 


It  Is  easy  to  see  that  the  second  sum  of  (3.2.4)  does  not  depend  on  the 
action  S.  Hence,  to  find  the  Rayes  rule.  Me  can  omit  the  second  sum 
and  consider  only  the  first  sum  In  (3.2.4)  as  our  loss  function.  Then, 


* L . 1.  tf  4(v°i)fo(&)d£(^ 

U 1ta(£)  ~ 

- [6  ^ Li(ereo)V^d^]d,L* 


So,  If  4*(x)  « S*  Is  the  Bayes  rule,  then  Me  find  that  1 tS*  If 

L . ,4<eo'"t'o7  "W  i r >x 

; (0,e0Jit(x^,«>)  1 e0vxl  1 


Hence,  1 fc  S*  If 


(1)  x<  > 6g,  or 
(11)  x^  < and 


‘2°0  f e!  dW  “ L2CG1 (oq)-G1 (x1 )3 
1 


(3.2.5) 
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The  condition  in  (ii)  is  equivalent  to  H^Xj)  ic^Oq),  where 
r °°  , 

H1(x1)  “ L2e0  j e7  dGi(oi)  ♦ L2Gi (xi } • and 
X1  1 

ci(eo}  " L2Gi(0O)  + Ll(1-G1(0O»  - he0  f r dGi^ei>- 

e0  1 

Since  H^Xj)  is  decreasing  in  x<  for  x1  < eQ,  so  (i)  and  (ii) 
reduce  to  xi  > eQ-b1  where  b1  > 0 and  satisfies  H.fbj)  * c^). 
This  shows  that  for  any  (5,  Gupta  type  rules  are  Bayes  rules.  Now, 
since  G is  unknown,  the  Bayes  rule  is  not  obtainable.  To  find  a.o. 
rules,  we  need  to  estimate  £.  In  view  of  (3.2.3),  we  need  to  estimate 
M,.  and  m.. 

Let  {Yi}M  be  a sequence  of  iid  random  variables  with  a coercion 

distribution  function  K(y).  We  also  assume  that  K'(y)  « k(y)  exists 

a.e..  Let 

Kn(y)  = n J1I(-,y](Y1)  (3-2-6) 

and 

kn(y)  " RC^(y+h)-Kn(y)].  (3.2.7) 

Then,  *n(y)  -*•  K(y)  uniformly  in  y with  probability  1 (Glivenko- 
Cantelli  Theorem)  as  n «.  The  following  lemma  guarantees  the 
convergence  of  kn(y)  to  k(y). 


1 


f 
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Lemma  3.2.1.  (Parzen  (1962)) 

(1)  If  h * h(n)  In  (3.2.7)  satisfies  11m  h(n)  * 0,  then 

n-*» 

11m  E[k  (y) ] * k(y)  for  any  continuous  point  y of  k(  - ) • 
n-*« 

(11)  If  In  addition  to  (1),  h(n)  also  satisfies  11m  nh(n)  = «•, 

2 

then  11m  E| k (y)-k(y) | ■ 0 for  any  continuous  point  y of 

n-*« 

k(*). 


Remarks: 

1.  In  our  problem  m^(y)  * J jj-dG^(e),  hence  m^(y)  Is  continuous 

at  all  y.  So,  (1)  and  (11)  In  Lemma  3.2.1  hold  for  all  y. 

2.  By  Chebyshev's  Inequality  and  (11),  we  have 

Elk  (v)-k(v)!2 

lint  P[|kn(y)-k(y)|  > c]  < 11m  — ^ » 0. 

n-*<>  e 


Hence,  if  h ■+  0 and  nh  0,  It  is  shown  that  kn (y ) * k(y) 
In  (p). 


Now,  we  state  a theorem  which  provides  a sufficient  condition  for 
{6q(j()>^l  to  be  empirical  Bayes.  Let 

(3.2.8) 


and 


^(V  a H1(x1}  ' c1(e0) 


s0<5)  * ^1*1  < eo  and  &g/xi)  -0K 
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Now,  for  ary  1 (1  < 1 < k),  let  A^  n(x^)  “ A1^x1  •Y11 Y1n>  for 

all  n * 1,2,*«*,  be  a sequence  of  measurable  real-valued  functions, 
we  define 

sn(x)  * (1|x1  < e0  and  Aj^Xj)  <.0}  (3.2.9) 

and 

«*(x)  = {1|xi  1 0O)  (3.2.10) 

Then  we  claim 

Theorem  3,2.1.  If  for  1 < 1 < k,  |*  0^dG|(e^)  < °»  and 
Ai>n(xi)  -*■  Aq  (x,)  in  (p)  for  almost  all  x^  < eQ.  Then 
U*(x)}”ai  defined  by  (3.2.10)  is  empirical  Bayes. 

Proof:  For  all  S € W,  let 

= {x  6 X|x^  > eQ  if  1 € S and  x^  < eQ  If  1 % S). 

Now,  for  any  x 5*(&)  * S 0 SQ(x).  Hence,  for  x € il^, 

[^(0.6*(x))^(x)dG(eJ 


Veo 


1td*(x) 

a l [-(  Li(Veo)fe(£)dS,(&)]+  l ag  (x.)  n mJxJ. 

its  ®1  >®0  ° *•  1€Sq(x;  G1  1 J-l  j j 

Similarly,  for  x € we  have 


I B 1 1 1 1 


' I'HikiM 1 maM ifc,  * 
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feUi.«;(x)>yx)<iG(e) 

■ I [-[  Li(ei-0o)fe(x)d£,(®)]+  I V,  K*)  n 

its  'et>e0  1 1 0 & ltsn(x)  Gl  1 j-l  j J 


Hence,  If 


Ai  n(xi)  ^(x.)  in  then 

o i /fcL(9,j;(x))f,(iL)d!i(9>-{eL(e,5*(y  )fa(x)d£(£) 

k 

- I,  J aG  <x1>-a1  n(x1^  " mi(xi> 

1 tS  (x)  b1  1 1,n  1 j-1  J J 

" J^I 

k 

+(  l - l n(xJ  n m.(x .) 
«„(>)  tS0(x)  1 j-1  J J 

k 

+ JA1  n(x1}'AG  (x1*l  n m1(x1) 

1€S0(x)  1*n  1 Gi  1 j-1  j j 

° jlM 

k k 

< 2e  l n m.(x.) 

1-1  j-1  J J 
j*1 


with  probability  near  1 for  all  n > N.  Note  that  (3.2.11)  Is  non- 
positive by  the  definition  of  Sn(x).  Now,  we  have  proved  that 

f * | L<8,«*(x))yx)clG(9) 


(3.2.11) 


In  (p),  for  almost  all  x.  By  Corollary  1 of  Robbins  (1964),  we 
conclude  that  {4JJ(x.Y^ ,• • • ,Xn)>*.1  Is  empirical  Bayes.  Thus 
completes  the  proof. 
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We  have  reduced  the  problem  of  finding  the  empirical  Bayes  rules 
to  the  problem  of  finding  a consistent  estimator  of  Ag  (x^  in  (p). 
If  we  recall  that 

mi^xi^  = | eT'dGi^9i^  and  6i^x)  = H^(x)-xm^  (x), 
x.  i 

Then  from  (3.2.8)  we  get 

ag.(V  = 4mi(xi)(Vxi)  + + Li£V®0>-13- 

Hence,  if  we  define 


ai.n(*i)  ' „(*i * l2[Mt,n<xt>-Mt,n<80» 

*L,C«1,n(e0)-U. 


(3.2.12) 


where 


Mi,n(x)  = n ^ ^O.xj^ij* 


(3.2.13) 


mi,n(x)  = iS*Mi,n(x+h>-Mi,n(x>3  = L 1 ^x.xWV  <3-2-14) 

3 * 

then  by  Lemma  3.2.1, 

A1,n(x1)  - AGi(xi)  in  (P} 

for  all  x.  Thus,  the  sequence  of  rules  {«*(x))"=1  which  is  defined 
by  (3.2.10),  with  A,>n(x^)  defined  by  (3.2.12),  is  empirical  Bayes. 
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3.3.  6q  unknown 


In  this  section,  nQ  is  a control  population  which  is  distributed 

as  U(O,0q)  with  0Q  unknown.  Let  Yor'",Yon  be  the  past  data 

collected  from  nQ,  and  let  XQ  be  the  present  observation  from  rig. 

Based  on  this  further  information,  we  will  search  for  empirical  Bayes 

rules  for  selecting  populations  better  than  the  control.  Note  that 

k 

now  0 = (eo,01,...,0k),  x = (x0,x1,---,xk)  and  G(e)  = Gi (@i ) . 

When  the  same  loss  function  is  used,  the  Bayes  rule  6*  now  becomes: 
i t 6*(x)  if 


UJ 


51  (80-e()dG,(9,)dG0(90) 


xQ  o '(o,e0]n(x1t<»)  i 
lL'  r*0%  l(» 0.-)n(x1..)i7  (6('eo)dGt(9i)dGo(9o)- 

Hence,  i t 6*(x)  if 

(1)  xj  > xQ  and  aJqG^VV  - °’  where 

AGn,G.^x0,xi^  = (LrL2)Cf  mi(eo^dG0^eO^+{  m0^9i^dGi^9i^ 

V I A • A • 


- L1 [1-G. (x. )]m0(x0)+m. (xi ) [L2+(L1 -L2 )Gf)(xi )-L1G0(xQ)3, 


(3.3.1) 


(11)  x^  < Xg  and  Ag  q (Xq.x^)  £ 0,  where 
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AGQ6i(x0*X1)  " (LrL2)[lxm  1^O)dGO(0O,f/^  "V)(9f)dGl(0l)J 

- ^(xo)th+<L2-h^G1S)•4S(xt,>L^n^xt)^1-6o(xo),• 

(3.3.2) 

When  L]  * L?  * L,  the  Bayes  rule  Is  greatly  simplified.  Then  we 
have  1 t S*(x)  if 

aG0,G^x0’x1^  * •no(*o)n-Gi(Xi)]-m<(xt)[1-G0(x0)]  > 0. 

Now,  a consistent  estimator  of  AG^|G^ (x0*Xj ) Is  obtained  by 

&i.n<Vx0>  * m0,n(x0>t1-fi1,n<xl)>m1,n<xi)C]“60,n<x0^ 
where  min(x)  is  defined  by  (3.2.13)  and  (3.2.14),  and 
Gi,n(x)  * M1,n(x)  ‘ xmi,n(x)  for  a11  0 < 1 < n.  Let 

«*(x)  * {1lAf,r/x1*V  -0)’  then 

{6n^}n=l  are  empirical  Bayes  by  Theorem  3.3.2. 

When  L.|  f L^,  we  need  to  find  consistent  estimators  of 
ja  "»i  (»0)dG0(o0)  and  f*  m^e^dG^e^). 

The  next  theorem  provides  us  with  such  estimators. 

T— o.rem  3.3.1.  Let  M^fx)  and  m<n(x)  be  defined  by  (3.2.13)  and 

(3.2.14),  respectively,  for  all  1 < 1 < k.  If  h > 0,  h - 0,  and 
2 

nh  ■*  « as  n > »,  and  if 


1 2S 


o^G^e^)  < <■*  for  all  0 <_  1 .<  k,  then 

“ £ xm1,n(x)dm0,n(x)  - £ m1(x)dG0(x)  In  (p) 
for  any  a > 0. 

Proof:  /a  »,t„(x)dm0n(x) 

1 i n n f® 

* TIT  A I xIrx  x+hi<Yn>«rv  hv  \(x) 

n fr  j=l  e=l  Ja  u*x+hJ  >J  LYo*'h,W 


l l n n 

77  I I (Ujf*V-  ),  where 

ir  j=l  e=l  je 


V = (YOe'h)I(a,»)(Yorh)I(Yorh,Y0t](Yij)’  and 


Vjt  = Y0tI(ai-)(Y0t)I(Y0tiY0t+h](Y1j)* 


Now,  since  Y0!~M0(x)  and  Yij~Mi(x)  for  all  l<j,  t < n, 
we  have 


E [a  x™,,n(x)*»0>n(x) 

r i /*+h  i 

" Ja  x F Jx  dMi (y)  ^ [m0(x+h)-m0(x)]dx.  Also,  because 

1 fx+h  ^ fx+h  r i 

Fjx  dM1<y>  - K Jx  i-  dG,(e)dy 

1 /-x+h  ,co 


(3.3.3) 


so, 


V 


I*  R I *1 <»>  K ["VJ(»*h)-^0(x)]|  < 1 fX*h  i dtye). 

^ * X 


hence  by  LDCT,  we  get 


lim  E £ *"i  ,„(*)<% in(x)  = I*  xm^x^Mdx 
" - f"  "^(xJdGgU). 


(3.3.4) 


Var  f xmj(x)dmg(x)  = Var  -j  -1  £ (U.  -V  ) 

Ja  n tr  j,e 

= “O  { J Var(Ujt‘Vji)+  ^ I Cov(ui.  "vi«  *uif  “v<» 
n h j,*.  J J j=l  ^*1  ^*2  ^*2 

n 

+ J l Cov(ui  rvi  f*ui  rvi  »)> 
t=l  J1fj2  ^1*  ^2*  **2* 


-jJj  Var(U„-V„)  * Co»(Un-»),.ll12-VI2) 


♦^ic°'(#,rWV 


(3.3.5) 


but  VarlU,,-#,,)  < E[(U„-V„)2]  - E(uf,)  ♦ £(»*,),  ,„d 


i i /x+h 

h E(u1i)  s ja  X KJ  ^(yJdMgCx+h) 

£[,!>'’-V,|]LXw 


I I 
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£ f [1-G^(x)]dx 
Ja 

< f*  edG . (e ) , by  (3.3.3) 

J0  1 

1 2 

so  frECUj,)  is  bounded  for  all  h.  Similarly,  we  can  prove  that 
E(V^)  < 9d6o(0),  hence 

l Var(UirVn)  < 2 | edG0(e).  (3.3.6) 


Meanwhile, 


Cov(U^*V^i  tU^'V^)  s Cov(U^  .U^)  ^ Cov(V'j^  ,1/^) 
- Cov(UlltV12)  - Cov(V11,U12), 


and 


l4cov(Un,U12)|  i-7  {E(U11'U12)+E(U11)E(U12)}  1 20-H,(«)). 

h h 

Similarly,  we  can  prove  that  2(1-M^(a))  Is  also  an  upper  bound  for 

I— Jr  Cov(V11  ,V12) I . |-j  Cov(Un,V12)|  and  |— Cov(V1 1 .U]2) | . Hence 
h h h " 

1 4 Cov(U1 1 -V1 } ,U-| 2-V1 2) | <.  8(1-M1(a)). 


Finally,  | -J-  Cov(U1 , -V]  1 ,U21  -V21 ) | < 80-M^a)). 
Hence,  from  (3.3.5)  we  get 

,ar  rxM1.nWd,W),)->0 


(3.3.7) 

(3.3.8) 
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if  nh  and  h +0  by  (3.3.6),  (3.3.7)  and  (3.3.8).  The  fact 
that  the  variance  goes  to  0 and  the  expected  value  converges  to 

- nij (x)dGg(x)  as  shown  In  (3.3.4)  implies  that 

»«  (p) 

by  Chebyshev's  inequality.  This  finishes  the  proof. 

Recall  that  the  Bayes  rule  Is 

**(£)  ■ Hlx,  i*0  and  4^G((x0.k,)  < 01 

Mf|x,  < x0  and  »|0G((Vl>i0> 

- sfW  u 

Wh6re  AG0.G,(x0'xt)  Jnd  ‘G0,G,<x0,xi)  defined  by  (3.3.1)  and 

(3.3.2)  respectively.  Not,,  Theorem  3.2.1  has  a similar  version  for 
oQ  unknown. 

Ihgorem  3.3.2.  Assume  that  edGf(e)  < - for  all  0 < 1 < k.  If 
for  all  1 < 1 < k,  A]tn(x0,xf)  ^(Xq.x,)  In  (p)  for  x,  > V 

and  A-| ,n(x0*x1 ) " AG^,GQ^X0’X1^  *n  (p)  for  < *q*  Then  **  let 

*W  - <*IX1  > x0  and  A{in(*0,x1)i0) 

u {iJx1  < xo  and  Al,n(x0*xl)  i0) 

us^(&). 
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we  have  {«*(x))“=j  Is  empirical  Bayes. 


Proof: 


1 k 2 k 

* I Ag  - (^niXi)  n m.(x.)  + £ A*  £ (xniXj)  n m.(x.) 

Its^(x)  Gi,G0  0 1 j«l  J J i*s*(&)  GrGo  0 1 j=l  i j 


and 


/bL(g.,«;(x))f^(&)d£(a) 

1 ^ o k 

* l ag  g (xo»xJ  n Mx<)  + I - (xn,xj  n m.(x.). 

*< w 10  i;: J 1 10  j;i 

Now,  following  the  same  method  as  In  the  proof  of  Theorem  3.2.1,  we  can 
show 

, k k 

\ 4g,  ,s0(xt  •xo)  ",  * 1 “g,  ,(L1(xrxo>  ", 

<«;<*>  ' 0 j;l  ««:<*)  ’ 0 j;! 

In  (p)  for  t ■ 1,2.  Hence  by  Lemma  3.2.2,  (6*(x))|^j  Is  empirical 
Bayes.  This  completes  the  proof. 

Now,  let 

‘t.n'VV  ' Wj.  x"t.n(x)'*V),n(x) 

X1 

* f x'to.n(x>a"t,n(x)HlI'-G),r,(x(»0,a(x0) 


* m1  ,n*x1  ^L2^LrL2^60,n^x1  *“Ll60,n(x0^ 


(3.3.10) 


k 


j 


! 
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and 

al.n(x1  -V  = lL2-Ll>[t  m1 

x0 

* f xVn<x>d%n<x>>4^0.n(x0>>1.n(x1> 
x0 

- m0,n(x0)[Llt<L2-Ll'G1,„(x0>-L26i,n(x1>J-  (3.3.11) 

then  by  Theorem  3.3.1,  the  conditions  of  Theorem  3.3.2  are  satisfied. 
Hence,  (3.3.9),  (3.3.10)  and  (3.3.11)  define  a sequence  of  empirical 
Bayes  rules. 

3.4.  Generalization  and  simulation 

Let  p.j(x)  be  a positive  continuously  differentiable  function 
which  is  defined  over  (0,»)  for  all  1 < 1 < k.  Also  let 
-1  f 9 

ci<o)  = J P^xjdx  for  e > 0,  then  f^xfe)  = p1  (x)c1  (e)I(0  0j(x) 

is  a density  function.  In  this  section,  we  assume  that  population 
n1  ~ fi(xlei)  for  all  1 < i < k.  Under  the  formulation  of  Section  3.2, 
we  wish  to  find  the  empirical  Bayes  rules  for  these  more  general 
density  functions.  For  simplicity,  we  assume  that  * L and 

that  eQ  Is  known.  Also,  we  assume  that  G^e)  has  a continuous 
density  g1 (0)  with  a bounded  support  [O.c^],  and  Is  known  for 
all  1 < 1 < k.  Now, 

mi (x)  - fi(x|e)dGi(e)  - p1  (x)  J c^eJdG^e). 


If  we  follow  the  same  discussion  as  In  Section  3.2,  we  can  show  that 
the  Bayes  rule  <s*  Is:  It  <S*(x)  Iff 


0)  xt  i oQ,  or 


a1  a1 

(11)  x.  < eQ  and  0Q  | Cf(x)dG,(x)  < | 

•'x^  ^xi 

Hence,  we  find  It  <S*(x)  Iff  x^  > 0Q-d^ 
fa1 

Jd  (•0-x)c1(x)dG1(x)  • 0.  Let 


xct(x)dG.(x). 

where  d^  satisfies 
(Yn  »•  • • »Yin)  be  a con- 


sistent estimation  of  d^ , then  <sj|(x)  = { i | x . > e0-d.  n)  defines  a 
sequence  of  empirical  Bayes  rules,  and  these  are  (weak)  admissible  in 
the  sense  that  6®( * ,Yj .• • • .J^)  is  an  admissible  rule  for  the  non- 
empirlcal  problem  for  all  V, »•••»!„  *”<1  n (see  Houwelingen  (1976), 

^1  'tI 

Meeden  (1972)).  However,  to  find  such  a sequence  {d<  )“  , is  very 

1 %n  fi“  I 

difficult,  hence  In  view  of  Theorem  3.2.1,  the  more  practical  way  to 
find  the  empirical  Bayes  rules  Is  to  estimate 


Lenina  3.4.1.  Let  p^(x)  and  G^(x)  be  defined  as  above.  If 

m.  „(*)  is  defined  by  (3.2.14)  with  h ■>  0,  nh  -*■  »>,  then  we  have 
i »n 

(“1  xpj(x)  /“I  ,“i 

j 7^7  "1  .»(x)dx  - I pJlxT  *1  ,«<x>  * J xc,(x)d6,(x) 

In  (p). 


Proof:  Since 
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xpj(x)  f°1  xpj(x) 

—5 — m.  (x)dx  •*>  — i — m.(x)dx  In  (p). 

Jx.  p‘(x)  t»n  Jx.  1 


PfCx) 


Since 


;a1  rai  j m.(x) 

J,  _«,<*>*,(*>■  Jx 


/“»  xpj(x)  ,at 

— 5 — m.  (x)dx  - 
Jx.  d‘(x!  1 J 


the  proof  Is  completed. 


Now,  let 


A1,n(V 


°Om1  n^x1  J f*1  x f*1  XP^X^ 

■wW,(  i^Tdni."(x)-L)  ^"'."<x),ix- 


(3.4.1) 


then 

«*(x)  ■ (1|x1  i e0)o{1|xi  < eQ  and  aJ^Xj)  < 0) 
defines  a sequence  of  empirical  Bayes  rules. 

Empirical  Bayes  rules  are  useful  only  if  we  can  control  the  rate  of 
convergence.  Johns  and  Van  Ryzln  (1971,  1972),  Houwellngen  (1973,  1976), 
Van  Ryzln  and  Susarla  (1977),  and  Gilliland  and  Hannan  (1977)  have  de- 
rived theoretical  upper  bounds  for  rn(J5,«*)  - r(£)  under  very  general 
assumptions.  Applying  Lemma  3 of  Van  Ryzln  and  Susarla  (1977),  we  get 


. • nut*. V*. . A st*  - “L-.. A * ■*£***+>'•+ 
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f°  1 

Lemma  3.4.2.  Let  Ag  (x)  » j (oQ- t )c^ ( t )dG^ ( t) I a j(x),  then 

lx  1 

0 i r (£*«„)  ■ r(&)  “ l <[  Iag  (x)  < 0]dx 

n n ial  jh  b1  i i,n 

+ f |&G  (x)pi(x)|P[A*>n(x)  > 0]dx>, 

* ^2  1 * 

where  a?  (x)  and  6*  are  defined  by  (3.4.1)  and  (3.4.2), 

1 % n n 

respectively,  and  Hj  = {x|x  < 0Q  and  Ag  (x)  > 01  and 

h2  = *xlx  - eo  ancl  ag  < Now‘  0(®n)  denote  a quantity 

0(«j 

such  that  0 <_  lim  < ®,  Then  since  |ap  (x)p.(x)|  < M.  for  some 

n-«  n u.  1 - 1 

constant  , so 

- r(G)  < H,(|  Pt4*„(x)  < 0]dx 

* L PU,*n(x)  lOJdx). 

H2 

Therefore,  if  for  all  x > 0 and  n -*■  », 

P[|Ai*n(x)-AG^(x)|  > |a6^(x)|]  » 0(an), 

then 


How,  by  the  Inequality 


PClAi*  (X)-A6  (*)l  > UG  (X)|]  < --jiH(X.)3-  , 

1,n  Gi  ’•[|AGi(x)H46i(x).Eat*n(x)|]< 


. „ 


rn(fc4!l>-r<S>  * °<an>- 


In  the  last  part  of  this  chapter,  we  let  X<  ^11(0,^)  for 
* * eo  *s  treated  *s  unknown.  Assume  that 

20 

9^0)  ■ I(0,c)(9)  for  1 * 0*1  and  L]  ■ l2  x By  Monte  Carlo 

studies,  we  determine  the  smallest  sample  size  N such  that 


Relative  error 


|r.(B,**)-r(8)| 


e 


for  N-4  < m < N.  The  values  of  N corresponding  to  selected  c and 

4 -i 

c are  shown  In  Table  III .1  for  h = n \ Table  III. 2 for  h = n 5, 

-1 

and  Table  III. 3 for  h = n 6,  where  h Is  used  to  define  (3.2.14). 


. 


TABLE  III 


Lists  of  values  of  the  smallest  N such  that 
-777-1 < e for  N-4  < m < N,  where  the 


density  of  priors  is 


" — " means  that  N > 400  (Monte  Carlo  study  was  curtailed 
because  of  limited  resources). 


SEjl 

BTiTl 

BP  >BB 

BBTJ 

SH 

,b 

BITS 

" means  that  N > 400  (Monte  C 


because  of  limited  resources). 


study  was  curtailed 
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